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Technical Field 

The present invention is concerned with mutations in 
proteins having biological functions as ion channels and, 
more particularly, with such mutations where they are 
associated with diseases such as epilepsy and disorders 
associated with ion channel dysfunction including, but not 
restricted to, hyper- or hypo-kalemic periodic paralysis, 
myotonias, malignant hyperthermia, myasthenia,, cardiac 
arrhythmias, episodic ataxia, migraine, Alzheimer's 
disease, Parkinson's disease, schizophrenia, 

hyperekplexia, anxiety, depression, phobic obsessive 
symptoms, neuropathic pain, inflammatory pain, 
15 chronic/acute pain, Bartter's syndrome, polycystic kidney 
disease. Dent's disease, hyperinsulinemic hypoglycemia of 
infancy, cystic fibrosis, congenital stationary night 
blindness and total colour-blindness. 

20 Background Art 

Epilepsies constitute a diverse collection of brain 
disorders that affect about 3% of the population at some 
time in their lives (Aiuiegers, 1996). An epileptic seizure 
can be defined as an episodic change in behaviour caused 

2 5 by the disordered firing of populations of neurons in the 

central nervous system* This results in varying degrees of 
involuntary muscle contraction and often a loss of 
consciousness. Epilepsy syndromes have been classified 
into more than 40 distinct types based upon characteristic 

3 0 symptoms, types of seizure, cause, age of onset and EEG 

patterns (Commission on Classification and Terminology of 
the International League Against Epilepsy, 1989), However 
the single feature that is common to all syndromes is the 
persistent increase in neuronal excitability that is both 
3 5 occasionally and unpredictably expressed as a seizure. 

A genetic contribution to the aetiology of epilepsy 
has been estimated to be present in approximately 4 0% of 
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affected individuals (Gardiner, 2000). As epileptic 
seizures may be the end-point of a number of molecular 
aberrations that ultimately disturb neuronal synchrony, 
the genetic basis for epilepsy is likely to be 
5 heterogeneous. There are over 200 Mendeliari diseases which 
include epilepsy as part of the phenotype. In these 
diseases, seizures are symptomatic of underlying 
neurological involvement such as disturbances in brain 
structure or function. In contrast, there are also a 

10 number of "pure" epilepsy syndromes in which epilepsy is 
the sole manifestation in the affected individuals. These 
are termed idiopathic and account for over 60% of all 
epilepsy cases. 

Idiopathic epilepsies have been further divided into 

15 partial and generalized sub-types. Partial (focal or 
local) epileptic fits arise from localized cortical 
discharges, so that only certain groups of muscles are 
involved and consciousness, may be retained. However, in 
generalized epilepsy, EEG discharge shows no focus such 

20 that all subcortical regions of the brain are involved. 
Although the observation that generalized epilepsies are 
fre<juently inherited is understandable, the mechanism by 
which genetic defects, presumably expressed constitutively 
in the brain, give rise to partial seizures is less clear. 

25 The molecular genetic era has resulted in spectacular 

advances in classification, diagnosis and biological 
understanding of numerous inherited neurological disorders 
including muscular dystrophies, familial neuropathies and 
spinocerebellar degenerations. These disorders are all 

3 0 uncommon or rare and have simple Mendelian inheritance. 
In contrast, common neurological diseases like epilepsy, 
have complex inheritance where they are determined by 
multiple genes sometimes interacting with environmental 
influences. Molecular genetic advances in disorders with 

35 complex inheritance have been far more modest to date 
(Todd, 1999). 
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Most of the molecular genetic advances have occurred 
by a sequential three stage process. First a clinically 
homogeneous disorder is identified and its mode of 
inheritance determined. Second, linkage analysis is 
performed on carefully characterized clinical populations 
with the disorder. Linkage analysis is a process where the 
chromosomal localization of a particular disorder is 
narrowed down to approximately 0.5% of the total genome. 
Knowledge of linkage imparts no intrinsic biological 
insights other than the important clue as to where to look 
in the genome for the abnormal gene. Third, strategies 
such as positional cloning or the positional candidate 
approach are used to identify the aberrant gene and its 
specific mutations within the linked region ( Coll ins , 
15 1995). 

Linkage studies in disorders with complex inheritance 
have been bedevilled by negative results and by failure to 
replicate positive findings. A sense of frustration 
permeates current literature in the genetics of complex 

2 0 disorders. Carefully performed, large scale studies 

involving hundreds of sibpairs in disorders including 
multiple sclerosis and diabetes have been essentially 
negative (Bell and Lathrop, 1996; Lernmark and Ott, 1998)* 
An emerging view is that such disorders are due to the 
25 summation of many genes of small effect and that 
identification of these genes may only be possible with 
very large-scale association studies. Such studies on a 
genome-wide basis are currently impossible due to 
incomplete marker sets and computational limitations. 

3 0 The idiopathic generalized epilepsies (XGE) are the 

most common group of inherited human epilepsy and do not 
have simple inheritance. Like other complex disorders, 
linkage studies in IGE have generated controversial and 
conflicting claims* Previous authors have suggested the 
35 possibility of multifactorial, polygenic, oligogenic or 
two locus models for the disease (Andermann, 1982; Doose 
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and Baier, 1989; Greenberg et al., 1988a; 1992; Janz et 
al., 1992). 

Two broad groups of IGE are now known - the classical 
idiopathic generalized epilepsies {Commission on 
5 Classification and Terminology of the International League 
Against Epilepsy, 1989) and the newly recognized genetic 
syndrome of generalized epilepsy with febrile seizures 
plus (GEFS+) (Scheffer and Berkovic, 1997; Singh et al., 
1999) . 

10 The classical IGEs are divided into a number of 

clinically recognizable but overlapping sub- syndromes 
including childhood ' absence epilepsy, juvenile absence 
epilepsy, juvenile myoclonic epilepsy etc (Commission on 
Classification and Terminology of the International League 

15 Against Epilepsy, 1989; Roger et al., 1992). The sub- 
syndromes are identified by age of onset and the pattern 
of seizure types (absence, myoclonus and tonic-clonic). 
Some patients, particularly those with tonic-clonic 
seizures alone do not fit a specifically recognized sub- 

20 syndrome. Arguments for regarding these as separate 
syndromes, yet recognizing that they are part of a 
neurobiological continuum, have been presented previously 
(Berkovic et al. 1987; 1994; Reutens and Berkovic, 1995). 

GEFS* was originally recognized through large multi- 

25 generation families and comprises a variety of sub- 
syndromes. Febrile seizures plus (FS*) is a sub-syndrome 
where children have febrile seizures occurring outside the 
age range of 3 months to 6 years, or have associated 
febrile tonic-clonic seizures. Many family members have a 

3 0 phenotype indistinguishable from the classical febrile 
convulsion syndrome and some have FS + with additional 
absence, myoclonic, atonic, or complex partial seizures. 
The severe end of the GEFS* spectrum includes myoclonic- 
astatic epilepsy. 

35 The cumulative incidence for epilepsy by age 3 0 years 

(proportion suffering from epilepsy at some time) is 1.5% 
(Hauser et al., 1993)* Accurate estimates for the 
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cumulative incidence of the IGEs are unavailable. In 
epidemiological studies where attempts are made to 
subclassify epilepsies, rather few cases of IGE are 
diagnosed, and many cases are unclassified* This is 
5 probably because cases are rarely directly examined by 
epileptologists. In clinic- or office-based series seen by 
experts, most cases are classifiable and IGEs account for 
about 25% of cases. This suggests that about 0.3% of the 
population suffer from IGE at some time. 

10 In outbred populations many patients with classical 

IGE appear to be sporadic as siblings and parents are 
usually unaffected. Systematic EEG studies on clinically 
unaffected family members show an increase in age- 
dependent occurrence of generalized epileptiform 

15 discharges compared to controls. In addition, to the 
approximate 0.3% of the population with clinical IGE, 
systematic EEG studies suggest that about 1% of healthy 
children have generalized epileptiform discharges while 
awake (Cavassuti et al., 1980; Okubo et al., 1994). 

20 Approximately 5-10% of first degree relatives of 

classical IGE probands have seizures with affected 
relatives usually having IGE phenotypes or febrile 
seizures. While nuclear families with 2-4 affected 
individuals are well recognized and 3 generation families 

25 or grandparent -grandchild pairs are occasionally observed 
(Italian League Against Epilepsy Genetic Collaborative 
Group/ 1993), families with multiple affected individuals 
extending over 4 or more generations are exceptionally 
rare. 

3 0 For GEFS*, however, a number of large mult i- generation 

families showing autosomal dominant inheritance with 

* 

incomplete penetrance are known. Similar to classical IGE, 
analysis of sporadic cases and small families with GEFS + 
phenotypes does not suggest simple Mendelian inheritance. 
35 Indeed, bilineal inheritance, where there is a history of 
epilepsy on maternal and paternal sides, is observed in 
both GEFS* and classical IGE families (Singh et al., 1999; 



Italian League Against Epilepsy Genetic Collaborative 
Group, 1993). 

Within single families with classical IGB or GEFS*, 
affected individuals often have different sub- syndromes . 
5 The closer an affected relative is to the proband, the 
more similar are their sub -syndromes, and siblings often 
have similar sub-syndromes (Italian League Against 
Epilepsy Genetic Collaborative Group, 1993), Less 
commonly, families are observed where most, or all, known 

10 affected individuals have one classical IGE sub- syndrome 
such as childhood absence epilepsy or juvenile myoclonic 
epilepsy (Italian League Against Epilepsy Genetic 
Collaborative Group, 1993) . 

Importantly, sub-syndromes are identical in affected 

15 monozygous twins with IGE. In contrast, affected dizygous 
twins, may have the same or different sub-syndromes* 
Classical IGE and GEFS* sub-syndromes tend to segregate 
separately (Singh et al., 1999). 

In some inbred communities, pedigree analysis 

2 0 strongly suggests recessive inheritance for juvenile 
myoclonic epilepsy and other forms of IGE (Panayiotopoulos 
and Obeid, 1989; Berkovic et al*, 2000). In such families, 
sub- syndromes are much more similar in affected siblings 
than in affected sib-pairs from outbred families. 

25 Recently, a family with an infantile form of IGE with 
autosomal recessive inheritance, confirmed by linkage 
analysis, was described in Italy (Zara et al., 2000). 

Most work on the molecular genetics of classical IGEs 
has been done on the sub- syndrome of juvenile myoclonic 

30 epilepsy where a locus in proximity or within the HLA 
region on chromosome 6p was first reported in 1988 
(Greenberg et al. , 1988b) . This finding was supported by 
two collaborating laboratories, in separate patient 
samples, and subsequently three groups provided further 

35 evidence for a 6p locus for juvenile myoclonic epilepsy in 
some, but not all, of their families. However, genetic 
defects have not been found and the exact locus of the 



gene or genes, in relationship to the HLA region, remains 
controversial. Strong evidence for linkage to chromosome 6 
also comes from a study of a single large family with 
juvenile myoclonic epilepsy, but in this pedigree the 
5 locus is well outside the HLA region. A locus on 
chromosome 15q has also been suggested for juvenile 
myoclonic epilepsy, but was not confirmed by two other 
studies . 

In general, the results of studies of the putative 

10 chromosomal 6p locus in the HLA region in patients with 
absence epilepsies or other forms of idiopathic 
generalized epilepsies have been negative. The major 
exception is that study of probands with tonic -clonic 
seizures on awakening, a sub- syndrome closely related to 

15 juvenile myoclonic epilepsy, suggests linkage to 6p. 

Linkage for classical remitting childhood absence 
epilepsy remains elusive, but in a family with persisting 
absence evolving into a juvenile myoclonic epilepsy 
phenotype, linkage to chromosome Ip has been claimed* ito 

20 Indian psdigree with persisting absence and tonic -clonic 
seizures may link to 8<x24. Linkage to this region was also 
suggested by a non -parametric analysis in IGE, 
irrespective of sub syndrome , but was not confirmed in 
another study. Other loci for XGEs that have been reported 

25 in single studies include 3pl4, 8p, 18 and possibly 5p. 
The unusual example of recessively inherited infantile 
onset IGE described in Italy maps to 16p in a single 
family. 

Thus, like most disorders with complex inheritance , 
3 0 the literature on genetics of classical XGEs is confusing 
and contradictory. Some, and perhaps much, of this 
confusion is due to heterogeneity, with the likelihood of 
a number of loci for IGEs. The studies reviewed above were 
principally performed on multiple small families, so 
3 5 heterogeneity within and between samples is probable. 
Whether all, some, or none of the linkages reported above 
will be found to harbour relevant genes for IGE remains to 
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be determined. Most of the studies reviewed above used 
analysis methods assuming Mendelian inheritance, an 
assumption that is not correct for outbred communities. 
Some studies used multiple models (autosomal recessive, 
5 autosomal dominant) . Although parametric linkage analysis 
may be reliable in some circumstance of analyzing complex 
disease, it can lead to spurious findings as highlighted 
by the literature on linkage in major psychoses (Risen and 
Botstein, 1996} , 

10 In so far as GEFS* is concerned, linkage analysis on 

rare mult i -generation large families with clinical 
evidence of a major autosomal dominant gene have 
demonstrated loci on chromosomes 19q and 2q. Both the 19q 
and 2q GEFS* loci have been confirmed in independently 

15 ascertained large families, and genetic defects have been 
identified. Families linked to 19q are known and a 
mutation in the gene for the pi subunit of the neuronal 
sodium channel (SCN1B) has been identified (Wallace et 
al., 1998). This mutation results in the loss of a 

2 0 critical disulphide bridge of this regulatory subunit and 

causes a loss of function in vitro. Families linked to 2q 
are also known and mutations in the pore -forming a subunit 
of the neuronal sodium channel (SCN1A) have been 
identified (PCT/AU01/01648 Wallace et al., 2001b; Escayg 
25 et al., 2000). Studies on the more common small families 
with GEFS* have not revealed these or other mutations to' 
date. 

In addition to the SCN1B and SCN1A mutations in GEFS* , 
four other gene defects have been discovered for human 

3 0 idiopathic epilepsies through the study of large families. 

Mutations in the alpha-4 subunit of the neuronal nicotinic 
acetylcholine receptor (CHRNA4) occur in the focal 
epilepsy syndrome of autosomal dominant nocturnal frontal 
lobe epilepsy (Australian patent AU-B-56247 /96; Steinlein 
35 et al., 1995)* Mutations in the gamma-2 subunit of the 
GABAa receptor (GABRG2) have been identified in childhood 
absence epilepsy, febrile seizures (including febrile 
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seizures plus) and myoclonic epilepsy (PCT/AU01/ 00729; 
Wallace et al., 2001a). Finally, mutations in two 
potassium channel genes (KCNQ2 and KCNQ3) were identified 
in benign familial neonatal convulsions (Singh et al., 
5 1998; Biervert et al., 1998; Charlier et al., 1998). 
Although initially regarded as a special form of IGE, this 
unusual syndrome is probably a form of inherited focal 
epilepsy. 

Further to these studies, mutations in other genes 
10 have been identified to be causative of epilepsy. These 
include mutations in the beta-2 subunit (CHRNB2) of the 
neuronal nicotinic acetylcholine receptor (PCT/AU01/00541; 
Phillips et al., 2001) and the delta subunit (GABRD) of 
the GABA A receptor (PCT/AU01/0G729) ♦ 
15 A number of mouse models approximating human IGE are 

known. These mice mutants have ataxia in addition to 
generalized spike -and- wave discharges with absences or 
tonic -clonic seizures* Recessive mutations in calcium 
channel subunit genes have been found in lethargic 
20 (CACKJB4 ) / tottering/leaner (CACMA1A) , and stargazer 
(CACNG2) mutants. The slow-wave epilepsy mouse mutant has 
a mutation in the sodium /hydrogen exchanger gene, which 
may have important downstream effects on pH- sensitive ion 
channels . 

25 The human and mouse literature is now suggesting that 

the idiopathic epilepsies comprise a family of 
channelopathies with mutations in ion channel subunits of 
voltage-gated <eg SCN1A, SCN1B, KCNQ2, KCNQ3) or ligand- 
gated (eg CHRNA4, CHRNB2, GABRG2 , GABRD) types. These 

30 channels are typically comprised of a number of subunits, 
specified by genes on different chromosomes. The 
stoichiometry and conformation of ion channel subunits are 
not yet well understood, but many have multiple subunits 
in a variety of combinations. 

35 The involvement of ion channels in other 

neuro/ physiological disorders has also been observed 
(reviewed in Dworakowska and Dolowy, 2000). Mutations in 
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voltage-gated sodium, potassium, calcium and chloride 
channels as well as ligand-gated channels such as the 
acetylcholine and GABA receptors may lead to physiological 

w 

disorders such as hyper- and hypo-kalemic periodic 
5 paralysis, myotonias, malignant hyperthermia, myasthenia 
and cardiac arrhythmias . Neurological disorders other than 
epilepsy that are associated with ion channel mutations 
include episodic ataxia, migraine, Alzheimer's disease, 
Parkinson's disease, schizophrenia, hyperekplexia, 

10 anxiety, depression, phobic obsessive symptoms, as well as 
neuropathic pain, inflammatory pain and chronic/acute 
pain. Some kidney disorders such as Bartter's syndrome, 
polycystic kidney disease and Dent's disease, secretion 
disorders such as hyperinsulinemic hypoglycemia of infancy 

15 and cystic fibrosis, and vision disorders such as 
congenital stationary night blindness and total colour- 
blindness may also be linked to mutations in ion channels* 

Disclosure of the Invention 
20 In a new genetic model for the idiopathic generalised 

epilepsies (IGEs) described in PCT/AU01/ 00872 (the 
disclosure of which is incorporated herein by reference) 
it has been postulated that most classical IGE and GEFS + 
cases are due to the combination of two mutations in 
25 multi-subunit ion channels. These are typically point 
mutations resulting in a subtle change of function. The 
critical postulate is that two mutations, usually, but not 
exclusively, in different subunit alleles ("digenic 
model"), are required for clinical expression of IGE. It 
3 0 was further proposed that 

a) A number of different mutated subunit pairs can 
be responsible for IGE. Combinations of two 
mutated subunits lead to an IGE genotype with 
-30% penetrance. 

3 5 b) The total allele frequency of mutated subunits 

is ~%%. It was calculated that approximately 15% 
of the population has one or more mutated 
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subunit genes and 19s nave two or more mutated 
subunits, 

c) Sub -syndrome s are principally determined by the 
specific combination of mutated subunit pairs, 

5 although one or more other genes, including ion 

channel subunits, of smaller effect may modify 
the phenotype. 

d) Mutated subunit combinations that cause 
classical IGEs are largely separate from those 

10 that cause GEFS"*, although some subunits may be 

involved in both syndromes. 

e) Individuals with single * change of function* 
mutations would not have IGE, but such mutations 
may contribute to simple febrile seizures, which 

15 are observed with increased frequency in 

relatives of IGE probands. 
The model also proposes that subunit mutations with 
more severe functional consequences (eg breaking a 
disulphide bridge in SCNlB or amino acid substitution in 

20 the por© forming regions of SCWlA for GEFS* ) cause 
autosomal dominant generalized epilepsies with a 
penetrance of 60-90%. The precise sub-syndromes in GEFS + 
are determined by minor allelic variation or mutations in 
other ion channel subunits. Such "severe" mutations are 

25 rare (allele frequency <0.01%) and are infrequent causes 
of GEFS*. They very rarely, or perhaps never, cause 
classical IGE . 

The identification of molecular changes in ion 
channel subunits is therefore a significant step towards 

3 0 the elucidation of genetic variants that alone or in 
combination (based on the digenic model) give rise to an 
epilepsy phenotype, and to other neuro/ physio logical 
disorders associated with ion channel dysfunction. 

The present inventors have identified a number of 

35 novel mutations or variants in genes encoding subunits of 
ion channels in individuals with epilepsy* It will be 
appreciated that for each molecular defect one can provide 
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an isolated nucleic acid molecule coding for a protein 
having a biological function as part of an ion channel in 
a mamma 1, wherein a mutation event selected from the group 
consisting of point mutations, deletions, insertions and 
5 rearrangements has occurred so as to affect the 
functioning of the ion channel. In some instances this 
single mutation alone will produce a phenotype of epilepsy 
or other neuro/physiological disorders associated with ion 
channel dysfunction. 

10 In the case where a single mutation alone does not 

produce, say, an epilepsy phenotype, there would be 
provided one or more additional isolated nucleic acid 
molecules coding for proteins having a biological function 
as part of an ion channel in a mammal, wherein a mutation 

15 event selected from the group consisting of point 
mutations, deletions, insertions and rearrangements has 
occurred so as to affect the functioning of the ion 
channel. The cumulative effect of the mutations in each 
isolated nucleic acid molecule in vivo is to produce a 

20 epilepsy or another neuro/physiological disorders in said 
mammal. The mutations may be in nucleic acid molecules 
coding for protein subunits belonging to the same ion 
channel or may be in nucleic acid molecules coding for 
protein subunits that belong to different ion channels. 

25 Typically such mutations are point mutations and the 

ion channels are voltage -gated channels such as a sodium, 
potassium, calcium or chloride channels or are ligand- 
gated channels such as members of the nAChR/GABA super 
family of receptors, or a functional fragment or homologue 

3 0 thereof • 

Mutations may include those in non-coding regions of 
the ion channel subunits (eg mutations in the promoter 
region which affect the level of expression of the subunit 
gene, mutations in intronic sequences which affect the 
3 5 correct splicing of the subunit during mRNA processing, or 
mutations in the 5' or 3' untranslated regions that can 
affect translation or stability of the mRNA) . Mutations 
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may also and more preferably will be in coding regions of 
the ion channel subunits (eg nucleotide mutations may give 
rise to an amino acid change in the encoded protein or 
nucleotide mutations that do not give rise to an amino 
5 acid change but may affect the stability of the mRNA) . 

Mutation combinations may be selected from, but are 
not restricted to, those identified in Table 1. 

Accordingly in one aspect of the present invention 
there is provided a method of identifying a subject 

10 predisposed to a disorder associated with ion channel 
dysfunction, comprising ascertaining whether at least one 
of the genes encoding ion channel subunits in said subject 
has undergone a mutation event as set forth in one of SEJQ 
ID Numbers: 1-62. 

15 In another aspect of the present invention there is 

provided an Isolated nucleic acid molecule encoding a 
mutant or variant ion channel sub unit wherein a mutation 
event has occurred as set forth in one of SEQ ID Numbers : 
1-62, 

20 The mutation event disrupts the functioning of an ion 

channel so as to produce a phenotype of epilepsy, and/ or 
one or more other disorders associated with ion channel 
dysfunction, including but not restricted to, hyper- or 
hypo-kalemic periodic paralysis, myotonias, malignant 

25 hyperthermia, myasthenia, cardiac arrhythmias, episodic 
ataxia, migraine, Alzheimer's disease, Parkinson's 
disease, schizophrenia, hyper ekplexia, anxiety, 

depression, phobic obsessive symptoms, neuropathic pain, 
inflammatory pain, chronic/acute pain, Bartter's syndrome, 

30 polycystic kidney disease, Dent's disease, 

hyper insulinemic hypoglycemia of infancy, cystic fibrosis, 
congenital stationary night blindness and total colour- 
blindness, either alone or in combination with one or more 
additional mutations or variations in the ion channel 

35 subunit genes* 

In another aspect of the present invention there is 
provided an isolated nucleic acid molecule encoding a 
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mutant KCNQ2 subunit, wherein the mutation event has 
occurred in the C-terminal domain of the KCNQ2 subunit and 
leads to a disturbance in the calmodulin binding affinity 
of the subunit, so as to produce an epilepsy phenotype. 

In one form of the invention, the mutations are in 
exon 8 or exon 15 of the KCNQ2 subunit and result in the 
replacement of an arginine residue with a glycine residue 
at amino acid position 353/ or the replacement of a 
leucine residue with an arginine at amino acid position 
619. The R353G mutation occurs as a result of a C to G 
nucleotide substitution at position 1057 of the KCNQ2 
coding sequence as shown in SEQ ID NO: 34. The L619R 
mutation occurs as a result of a T to G nucleotide 
substitution at position 1856 of the KCNQ2 coding sequence 
as shown in SEQ ID NO: 37. 

In a further form of the invention, the mutations are 
in exon 11 or exon 14 of the KCNQ2 subunit and result in 
the replacement of an arginine residue with a stop codon 
at amino acid position 430, or the replacement of an 
arginine residue with a serin© at amino acid position 570. 
The R430X mutation occurs as a result of a C to T 
nucleotide substitution at position 1288 of the KCNQ2 
coding sequence as shown in SEQ ID NO: 35. The R570S 
mutation occurs as a result of an A to T nucleotide 
substitution at position 1710 of the KCNQ2 coding sequence 
as shown in SEQ ID NO: 36. 

Typically these mutations create a phenotype of 
benign familial neonatal seizures ( BFNS } . 

In a further aspect of the present invention there is 
provided a combination of two or more isolated nucleic 
acid molecules each having a novel mutation event as laid 
out in Table 1. The cumulative effect of the mutations in 
each isolated nucleic acid molecule in vivo is to produce 
an epilepsy or another disorder associated with ion 
channel dysfunction as described above in said mammal. 

In a particularly preferred embodiment of the present 
invention, the isolated nucleic acid molecules have a 
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nucleotide sequence as shown in any one of SEQ id Numbers: 
1-62. The sequences correspond to the novel DMA mutations 
or variants laid out in Table 1. 

In another aspect of the present invention there is 
5 provided an isolated nucleic acid molecule comprising any 
one of the nucleotide sequences set forth in SEQ ID 
Numbers: 1-62. 

In another aspect of the present invention there is 

provided an isolated nucleic acid molecule consisting of 

10 any one of the nucleotide sequences set forth in SEQ ID 
Numbers: 1-62. 

The nucleotide sequences of the present invention can 
be engineered using methods accepted in the art for a 
variety of purposes. These include, but are not limited 

15 to, modification of the cloning, processing, and/or 
expression of the gene product, PCR reassembly of gene 
fragments and the use of synthetic oligonucleotides allow 
the engineering of the nucleotide sequences of the present 
invention. For example, oligonucleotide -mediated site- 

20 directed mutagenesis can introduce further mutations that 
create new restriction sites, alter expression patterns 
and produce splice variants etc. 

As a result of the degeneracy of the genetic code, a 
number of polynucleotide sequences, some that may have 

25 minimal similarity to the polynucleotide sequences of any 
known and naturally occurring gene, may be produced. Thus, 
the invention includes each and every possible variation 
of a polynucleotide sequence that could be made by 
selecting combinations based on possible codon choices. 

30 These combinations are made in accordance with the 
standard triplet genetic code as applied to the 
polynucleotide sequences of the present invention, and all 
such variations are to be considered as being specifically 
disclosed. 

35 The nucleic acid molecules of this invention are 

typically DNA molecules, and include cDNA, genomic DNA, 
synthetic forms, and mixed polymers, both sense and 
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antisense strands, and may be chemically or biochemically 
modified , or may contain non-natural or derivatised 
nucleotide bases as will be appreciated by those skilled 
in the art. Such modifications include labels, 
5 methylation, intercalators, alkylators and modified 
linkages. In some instances it may be advantageous to 
produce nucleotide sequences possessing a substantially 
different codon usage than that of the polynucleotide 
sequences of the present invention. For example, codons 

10 may be selected to increase the rate of expression of the 
peptide in a particular prokaryotic or eukaryotic host 
corresponding with the frequency that particular codons 
are utilized by the host. Other reasons to alter the 
nucleotide sequence without altering the encoded amino 

15 acid sequences include the production of RNA transcripts 
having more desirable properties, such as a greater half- 
life, than transcripts produced from the naturally 
occurring mutated sequence. 

The invention also encompasses production of nucleic 

20 acid sequences of the present invention entirely by 
synthetic chemistry* Synthetic sequences may be inserted, 
into expression vectors and cell systems that contain the 
necessary elements for transcriptional and translational 
control of the inserted coding sequence in a suitable 

25 host. These elements may include regulatory sequences, 
promoters, 5' and 3' untranslated regions and specific 
initiation signals (such as an ATG initiation codon and 
Kozak consensus sequence) which allow more efficient 
translation of sequences encoding the polypeptides of the 

30 present invention* Xn cases where the complete coding 
sequence, including the initiation codon and upstream 
regulatory sequences, are inserted into the appropriate 
expression vector, additional control signals may not be 
needed. However, in cases where only coding sequence, or a 

35 fragment thereof, is inserted, exogenous translational 
control signals as described above should be provided by 
the vector. Such signals may be of various origins, both 
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* * 

natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used (Scharf et al. r 1994). 

The invention also includes nucleic acid molecules 
5 that are the complements of the sequences described 
herein. 

The present invention allows for the preparation of 
purified polypeptide or protein from the polynucleotides 
of the present invention, or variants thereof- In order to 

10 do this, host cells may be transformed with a novel 
nucleic acid molecule as described above, or with nucleic 
acid molecules encoding two or more mutant ion channel 
subunits. If the mutant subunits form a part of the same 
ion channel a receptor protein containing two or more 

15 mutant subunits may be isolated. If the mutant subunits 
are subunits of different ion channels the host cells will 
express two or more mutant receptor proteins. Typically 
said host cells are transfected with an expression vector 
comprising a DMA molecule according to the invention or, 

2 0 in particular, DMA molecules encoding two or more mutant 
ion channel subunits. A variety of expression vector /host 
systems may be utilized to contain and express sequences 
encoding polypeptides of the invention. These include, but 
are not limited to, microorganisms such as bacteria 

2 5 transformed with plasmid or cosmid DNA expression vectors; 

yeast transformed with yeast expression vectors; insect 
cell systems infected with viral expression vectors (e.g., 
baculovirus) ; or mouse or other animal or human tissue 
cell systems. Mammalian cells can also be used to express 

3 0 a protein using a vaccinia virus expression system. The 

invention is not limited t by the host cell or vector 
employed . 

The polynucleotide sequences, or variants thereof, of 
the present invention can be stably expressed in cell 
35 lines to allow long term production of recombinant 
proteins in mammalian systems. Sequences encoding the 
polypeptides of the present invention can be transformed 
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into cell lines using expression vectors which may contain 
viral origins of replication and/or endogenous expression 
elements and a selectable marker gene on the same or on a 
separate vector. The selectable marker confers resistance 
5 to a selective agent, and its presence allows growth and 
recovery of cells which successfully express 1 the 
introduced sequences. Resistant clones of stably 
transformed cells may be propagated using tissue culture 
techniques appropriate to the cell type. 

10 The protein produced by a transformed cell may be 

secreted or retained intracellularly depending on the 
sequence and/or the vector used. As will be understood by 
those of skill in the art, expression vectors containing 
polynucleotides which encode a protein may be designed to 

15 contain signal sequences which direct secretion of the 
protein through a prokaryotic or eukaryotic cell membrane. 

Xn addition, a host cell strain may be chosen for its 
ability to modulate expression of the inserted sequences 
or to process the expressed protein in the desired 

20 fashion. Such modifications of the polypeptide include, 
but are not limited to, acetylation, erlycosylation, 
phosphorylation, and acylation. Post -trans lational 
cleavage of a "prepro" form of the protein may also be 
used to specify protein targeting, folding, and/ or 

25 activity. Different host cells having specific cellular 
machinery and characteristic mechanisms for post- 
translational activities (e.g., CHO or HeLa cells), are 
available from the American Type Culture Collection (ATCC) 
and may be chosen to ensure the correct modification and 

30 processing of the foreign protein. 

When large quantities of the protein product of the 
gene are needed, such as for antibody production, vectors 
which direct high levels of expression of this protein may 
be used, such as those containing the T5 or T7 inducible 

35 bacteriophage promoter. The present invention also 
includes the use of the expression systems described above 
in generating and isolating fusion proteins which contain 
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important functional domains of the protein- These fusion 
proteins are used for binding, structural and functional 
studies as well as for the generation of appropriate 
antibodies . 

5 In order to express and purify the protein as a 

fusion protein, the appropriate cdna sequence is inserted 
into a vector which contains a nucleotide sequence 
encoding another peptide (for example, glutathionine 
succinyl transferase) • The fusion protein is expressed and 
10 recovered from prokaryotic or eukaryotic cells. The fusion 
protein can then be purified by affinity chromatography 
based upon the fusion vector sequence. The desired protein 
is then obtained by enzymatic cleavage of the fusion 
protein. 

15 Fragments of the polypeptides of the present 

invention may also be produced by direct peptide synthesis 
using solid-phase techniques. Automated synthesis may be 
achieved by using the ABI 431A Peptide Synthesiser 
(Perkin- Elmer) . Various fragments of this protein may be 

2 0 synthesized separately and then combined to produce the 

full-length molecule. 

The present invention is also concerned with 
polypeptides having a biological function as an ion 
channel in a mammal, wherein a mutation event selected 
25 from the group consisting of substitutions/ deletions, 
truncations , insertions and rearrangements has occurred so 
as to affect the functioning of the ion channel. In some 
instances this single mutation alone will produce an 
epilepsy phenotype or other neuro/physiological disorders 

3 0 associated with ion channel dysfunction. 

Xn the case where , a single mutation alone does not 
produce, say, an epilepsy phenotype r there would be 
provided one or more additional isolated mammalian 
polypeptides having biological functions as part of an ion 
35 channel in a mammal, wherein a mutation event selected 
from the group consisting of substitutions, deletions, 
truncations, insertions and rearrangements has occurred so 
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as to affect tlie functioning of the ion channel. The 
cumulative effect of the mutations in each isolated 
mammalian polypeptide in vivo being to produce epilepsy or 
another neuro/phyeiological disorders in said mammal . The 
5 mutations may be in polypeptide subunits belonging to the 
same ion channel as described above, but may also be in 
polypeptide subunits that belong to different ion 
channels. 

Typically the mutation is an amino acid substitution 

10 and the ion channel is a voltage-gated channel such as a 
sodium, potassium, calcium or chloride channel or a 
ligand- gated channel such as a member of the nAChR/GABA 
super family of receptors, or a functional fragment or 
homologue thereof. 

15 Mutation combinations may be selected from, but are 

not restricted to, those represented in Table 1. 

Accordingly, in a further aspect of the present 
invention there is provided an isolated polypeptide, said 
polypeptide being a mutant or variant ion channel subunit 

20 wherein a mutation event: has occurred such that the 
polypeptide has the amino acid sequence set forth in one 
of SEQ ID Numbers: 63-76. The mutation event disrupts the 
functioning of an ion channel so as to produce a phenotype 
of epilepsy, and/or one or more other disorders associated 

25 with ion channel dysfunction, including but not restricted 
to, hyper- or hypo-kalemic periodic paralysis, myotonias, 
malignant hyperthermia, myasthenia, cardiac arrhythmias, 
episodic ataxia, migraine, Alzheimer's disease, 
Parkinson's disease, schizophrenia, hyperekplexia, 

3 0 anxiety, depression, phobic obsessive symptoms, 
neuropathic pain, inflammatory pain, chronic /acute pain, 
Bartter ' s syndrome, polycystic kidney disease, Dent's 
disease, hyper insulinemic hypoglycemia of infancy, cystic 
fibrosis, congenital stationary night blindness and total 

35 colour-blindness. 

In a particularly preferred embodiment of the present 
invention, the isolated polypeptide has an amino acid 



i 



- 22 - 

sequence as shown in any one of SEQ ID Numbers: 63-76. The 
sequences correspond to the novel amino acid changes laid 
out in Table 1 for those instances where the DNA mutation 
results in an amino acid change, 
5 According to still another aspect of the present 

invention there is provided an isolated polypeptide, said 
polypeptide being a mutant KCNQ2 subunit r wherein the 
mutation event has occurred in the C-terminal domain of 
the KCNQ2 subunit and leads to a disturbance in the 

10 calmodulin binding affinity of the subunit, so as to 
produce an epilepsy phenotype . 

In one form of the invention the mutations are 
substitutions in which an arginine residue is replaced 
with a glycine residue, or a leucine residue is replaced 

15 with an arginine. Preferably the substitutions are R353G 
and L619R transitions as illustrated by SEQ ID NOS: 73 and 
76 respectively. 

In a further form of the invention the mutations 
result in the replacement of an arginine for a stop codon, 

2 0 or an arginine is replaced with a serine. Preferably the 

mutations are R430X and R570S transitions as illustrated 
by SEQ ID NOS; 74 and 75 respectively. 

In a still further aspect of the present invention 
there is provided a combination of two or more isolated 
25 polypeptides each having a novel mutation event as laid 
out in Table 1* The cumulative effect of the mutations in 
each isolated polypeptide molecule in vivo is to produce 
an epilepsy or another disorder associated with ion 
channel dysfunction as described above in said mammal. 

3 0 In a particularly preferred embodiment of the present 

invention, the isolated polypeptides have an amino acid 
sequence as shown in any one of SEQ ID Numbers: 63-76* The 
sequences correspond to the novel amino acid changes laid 
out in Table 1. 

3 5 According to still another aspect of the present 

invention there is provided an isolated polypeptide 
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comprising the amino acid sequence set forth in any one of 
SEQ ID Numbers: 63-76. 

According to still another aspect of the present 
invention there is provided a polypeptide consisting of 
5 the amino acid sequence set forth in any one of SEQ ID 
Numbers: 63-76. 

According to still another aspect of the present 
invention there is provided a method of preparing a 
polypeptide , comprising the steps of: 
10 (1) culturing host cells transfected with an 

expression vector comprising a nucleic acid 
molecule as described above uncle r conditions 
effective for polypeptide production; and 
(2) harvesting the mutant ion channel subunit. 
15 The mutant ion channel subunit may be allowed to 

assemble with other subunits constituting the channel that 
are either wild-type or themselves mutant subunits, 
whereby the assembled ion channel is harvested. 

According to still another aspect of the invention 

2 0 there is provided a polypeptide which is the product of 

the process described above. 

Substantially purified protein or fragments thereof 
can then be used in further biochemical analyses to 
establish secondary and tertiary structure. Such 
25 methodology is known in the art and includes , but is not 
restricted to. X-ray crystallography of crystals of the 
proteins or of the assembled ion channel incorporating the 
proteins or by nuclear magnetic resonance (NMR) • 
Determination of structure allows for the rational design 

3 0 of pharmaceuticals to interact with the ion channel as a 

whole or through interaction with a specific subunit 
protein (see drug screening below), alter the overall ion 
channel protein charge configuration or charge interaction 
with other proteins, or to alter its function in the cell. 
3 5 It will be appreciated that the mutant ion channel 

subunits included as part of the present invention will be 
useful in further applications which include a variety of 
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hybridisation and immunological assays to screen for and 
detect the presence of either a normal or mutated gene or 
gene product. The invention enables therapeutic methods 
for the treatment of epilepsy as well as other disorders 
5 associated with ion channel dysfunction and also enables 
methods for the diagnosis of epilepsy as well as other 
disorders associated with ion channel dysfunction. 

Therapeutic Applications 

10 According to still another aspect of the invention 

there is provided a method of treating epilepsy as well as 
other disorders associated with ion channel dysfunction, 
including but not restricted to, hyper- or hypo-kalemic 
periodic paralysis, myotonias, malignant hyperthermia, 

15 myasthenia, cardiac arrhythmias, episodic ataxia, 
migraine, Alzheimer's disease, Parkinson's disease, 
schizophrenia, hyperekplexia, anxiety, depression, phobic 
obsessive symptoms, neuropathic pain, inflammatory pain, 
chronic/acute pain, Bartter ' s syndrome, polycystic kidney 

20 disease, Dent's disease, hyperinsulinemic hypoglycemia of 
infancy, cystic fibrosis, congenital stationary night 
blindness or total colour-blindness, comprising 
administering a selective antagonist, agonist or modulator 
of an .ion channel or ion channel subunit, when the ion 

25 channel contains a mutation in a subunit comprising the 
channel, as described above, to a subject in need of such 
treatment. Said mutation event may be causative of the 
disorder when expressed alone or when expressed in 
combination with one or more additional mutations in 

3 0 subunits of the same or different ion channels, which are 
typically those identified in Table 1. 

In still another aspect of the invention there is 
provided the use of a selective antagonist, agonist or 
modulator of an ion channel or ion channel subunit when 

35 the ion channel contains a mutation in a subunit 
comprising the channel, as described above, said mutation 
being causative of epilepsy as well as other disorders 



■ 



- 25 



associated with ion channel dysfunction, including but not 
restricted to, hyper- or hypo-kalemic periodic paralysis, 
myotonias, malignant hyperthermia, myasthenia, cardiac 
arrhythmias, episodic ataxia, migraine, Alzheimer's 
disease, Parkinson's disease, schizophrenia, 

hyperekplexia, anxiety, depression, phobic obsessive 
symptoms, neuropathic pain, inflammatory pain, 
chronic/acute pain, Bartter's syndrome, polycystic kidney 
disease, Dent 1 s disease, hyper insulinemic hypoglycemia of 
infancy, cystic fibrosis, congenital stationary night 
blindness or total colour-blindness, when expressed alone 
or when expressed in combination with a second mutation in 
a subunit of the same or different ion channel, as 
described above, in the manufacture of a medicament for 
the treatment of the disorder. 

In one aspect, a suitable antagonist, agonist or 
modulator will restore wild-type function to the ion 
channel or channels containing the mutations of the 
present invention, or will negate the effects the mutant 
channel or channels have on cell function. 

Using methods well known in the art, a mutant ion 
channel may be used to produce antibodies specific for the 
mutant channel that is causative of the disease or to 
screen libraries of pharmaceutical agents to identify 
those that bind the mutant ion channel. 

In one aspect, an antibody, which specifically binds 
to a mutant ion channel or mutant ion channel subunit of 
the invention, may be used directly as an agonist, 
antagonist or modulator, or indirectly as a targeting or 
delivery mechanism for bringing a pharmaceutical agent to 
cells or tissues that express the mutant ion channel. 

In a still further aspect of the invention there is 
provided an antibody which is immunologically reactive 
with a polypeptide as described above, but not with a 
wild-type ion channel or ion channel subunit thereof. 

In particular, there is provided an antibody to an 
assembled ion channel containing a mutation in a subunit 
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comprising the receptor, which, is causative of epilepsy or 
another disorder associated with ion channel dysfunction 
when expressed alone or when expressed in combination with 
one or more other mutations in subunits of the same or 
5 different ion channels. Such antibodies may include, hut 
are not limited to r polyclonal, monoclonal, chimeric, and 
single chain antibodies as would be understood by the 
person skilled in the art. 

For the production of antibodies, various hosts 

10 including rabbits, rats, goats, mice, humans, and others 
may be immunized by injection with a polypeptide as 
described above or with any fragment or oligopeptide 
thereof which has immunogenic properties* Various 
adjuvants may be used to increase immunological response • 

15 and include, but are not limited to, Freund's, mineral 
gels such as aluminium hydroxide, and surf ace- active 
substances such as lysolecithin. Adjuvants used in humans 
include BCG (bacilli Calmette-Guerin) and Corynebacterium 
parvum. 

2 0 It is preferred that the oligopeptides, peptides, or 

fragments used to induce antibodies to the mutant ion 
channel have an amino acid sequence consisting of at least 
5 amino acids, and, more preferably, of at least 10 amino 
acids. It is also preferable that these oligopeptides, 
25 peptides, or fragments are identical to a portion of the 
amino acid sequence of the natural protein and contain the 
entire amino acid sequence of a small, naturally occurring 
molecule. Short stretches of ion channel amino acids may 
be fused with those of another protein, such as KLH, and 

3 0 antibodies to the chimeric molecule may be produced. 

Monoclonal antibodies to a mutant ion channel may be 
prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines 
in culture. These include, but are not limited to, the 
3 5 hybridoma technique, the human B-cell hybridoma technique, 
and the KBV-hybridoma technique. {For example, see Kohler 
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et al., 1975; Kozbor et al., 1985; Cote et al., 1983; Cole 
et al., 1984) . 

Monoclonal antibodies produced may include, but are 
not limited to, mouse-derived antibodies, humanised 
5 antibodies and fully human antibodies. 

Antibodies may also be produced by inducing in vivo 
production in the lymphocyte population or by screening 
immunoglobulin libraries or panels of highly specific 
binding reagents as disclosed in the literature. (For 
10 example, see Orlandi et al., 1989; Winter and Milstein, 
1991). 

Antibody fragments which contain specific binding 
sites for a mutant ion channel may also be generated. For 
example, such fragments include, F(ab')2 fragments 

15 produced by pepsin digestion of the antibody molecule and 
Fab fragments generated by reducing the disulfide bridges 
of the F(ab')2 fragments. Alternatively> Fab expression 
libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the 

20 desired specificity. (For example, see Huse et al., 1989). 

Var iou s immunoa s s ays may be us ed f or s c r eening t o 
identify antibodies having the desired specificity. 
Numerous protocols for competitive binding or 
immunoradiometric assays using either polyclonal or 

2 5 monoclonal antibodies with established specificities are 

well known in the art. Such immunoassays typically involve 
the measurement of complex formation between an ion 
channel and its specific antibody. A two-site, monoclonal- 
based immunoassay utilizing antibodies reactive to two 

3 0 non- interfering ion channel epitopes is preferred, but a 

competitive binding assay may also be employed. 

In a further aspect of the invention there is 
provided a method of treating epilepsy as well as other 
disorders associated with ion channel dysfunction, 
3 5 including but not restricted to, hyper- or hypo-kalemic 
periodic paralysis, myotonias, malignant hyperthermia, 
myasthenia, cardiac arrhythmias, episodic ataxia. 
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migraine, Alzheimer's disease, Parkinson's disease, 
schizophrenia, hyperekplexia, anxiety, depression, phobic 
obsessive symptoms, neuropathic pain, inflammatory pain, 
chronic/acute pain, Bartter's syndrome, polycystic kidney 
5 disease, Dent's disease, hyperinsulinemic hypoglycemia of 
infancy, cystic fibrosis, congenital stationary night 
blindness or total colour-blindness, comprising 
administering an isolated nucleic acid molecule which is 
the complement (antisense) of any one of the nucleic acid 

10 molecules described above and which encodes an RNA 
molecule that hybridizes with the mRNA encoding a mutant 
ion channel subunit of the invention, to a subject in need 
of such treatment. 

In a still further aspect of the invention there is 

15 provided the use of an isolated nucleic acid molecule 
which is the complement (antisense) of a nucleic acid 
molecule of the invention and which encodes an RNA 
molecule that hybridises with the mRNA encoding a mutant 
ion channel subunit of the invention, in the manufacture 

2 0 of a medicament for the treatment of epilepsy as well as 
other disorders associated with ion channel dysfunction, 
including but not restricted to, hyper- or hypo-kalemic 
periodic paralysis, myotonias, malignant hyperthermia, 
myasthenia, cardiac arrhythmias, episodic ataxia, 

2 5 migraine, Alzheimer's disease, Parkinson's disease, 

schizophrenia, hyperekplexia, anxiety, depression, phobic 
obsessive symptoms, neuropathic pain, inflammatory pain, 
chronic/acute pain, Bartter's syndrome, polycystic kidney 
disease, Bent's disease, hyperinsulinemic hypoglycemia of 
30 infancy, cystic fibrosis, congenital stationary night 
blindness or total colour-blindness* 

Typically, a vector expressing the complement 
(antisense) of the polynucleotides of the invention may be 
administered to a subject in need of such treatment. Many 

3 5 methods for introducing vectors into cells or tissues are 

available and equally suitable for use in vivo, in vitro, 
and ex vivo. For ex vivo therapy, vectors may be 
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introduced into stem cells taken from the patient and 
clonally propagated for autologous transplant back into 
that same patient* Delivery by transf ection, by liposome 
injections, or by polycat ionic amino polymers may be 
5 achieved using methods which are well known in the art. 
(For example, see Goldman et al., 1997), 

Additional antisense or gene-targeted silencing 
strategies may include, but are not limited to, the use of 
antisense oligonucleotides, injection of antisense RNA, 

10 transfection of antisense RNA expression vectors, and the 
use of RNA interference (RNAi) or short interfering RNAs 
(siRNA) • Still further, catalytic nucleic acid molecules 
such as DNAzymes and ribozymes may be used for gene 
silencing (Breaker and Joyce, 1994; Haseloff and Gerlach, 

15 1988). These molecules function by cleaving their target 
mRNA molecule rather than merely binding to it as in 
traditional antisense approaches. 

Xn a further aspect, a suitable agonist, antagonist 
or modulator may include peptides, phosphopeptides or 

20 small organic or inorganic compounds that can restore 
wild- type activity of ion channels containing mutations in 
the subunits which comprise the channels as described 
above * 

Peptides, phosphopeptides or small organic or 
25 inorganic compounds suitable for therapeutic applications 
may be identified using nucleic acids and peptides of the 
invention in drug screening applications as described 
below. Molecules identified from these screens may also be 
of therapeutic application in affected individuals 
30 carrying other ion channel subunit gene mutations if the 
molecule is able to correct the common underlying 
functional deficit imposed by these mutations and those of 
the invention. 

There is therefore provided a method of treating 
3 5 epilepsy as well as other disorders associated with ion 
channel dysfunction comprising administering a compound 
that is a suitable agonist, antagonist or modulator of an 
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ion channel and that has been identified using the mutant 
ion channel subunits of the invention. 

in some instances, an appropriate approach for 
treatment may be combination therapy. This may involve the 
5 administering an antibody or complement (antisense) to a 
mutant ion channel or ion channel subunit of the invention 
to inhibit its functional effect, combined with 
administration of wild- type ion channel subunits which may 
restore levels of wild-type ion channel formation to 

10 normal levels. Wild-type ion channel subunits of the 
invention can be administered using gene therapy 
approaches as described above for complement 
administration. 

There is therefore provided a method of treating 

15 epilepsy as well as other disorders associated with ion 
channel dysfunction comprising administration of an 
antibody or complement to a mutant ion channel or ion 
channel subunit of the invention in combination with 
administration of wild- type ion channel subunits. 

20 In still another aspect of the invention there is 

provided the use of an antibody or complement to a mutant 
ion channel or ion channel subunit of the invention in 
combination with the use of wild-type ion channel 
subunits, in the manufacture of a medicament for the 

2 5 treatment of epilepsy as well as other disorders 

associated with ion channel dysfunction. 

In further embodiments, any of the agonists, 
antagonists, modulators, antibodies, complementary 
sequences or vectors of the invention may be administered 

3 0 in combination with other appropriate therapeutic agents. 

Selection of the appropriate agents may be made by those 
skilled in the art, according to conventional 
pharmaceutical principles. The combination of therapeutic 
agents may act synergistically to effect the treatment or 

■ 

35 prevention of the various disorders described above. Using 
this approach, therapeutic efficacy with lower dosages of 
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each agent may be possible, thus reducing the potential 
for adverse side effects. 

Any of the therapeutic methods described above may be 
applied to any subject in need of such therapy, including, 
5 for example, mammals such as dogs, cats, cows, horses, 
rabbits, monkeys, and most preferably, humans. 

Drug Screening 

According to still another aspect of the invention, 

10 nucleic acid molecules of the invention as well as 
peptides of the invention, particularly purified mutant 
ion channel subunit polypeptide and cells expressing 
these, are useful for the screening of candidate 
pharmaceutical agents for the treatment of epilepsy as 

15 well as other as other disorders associated w y ith ion 
channel dysfunction, including but not restricted to, 
hyper- or hypo-kalemic periodic paralysis, myotonias, 
malignant hyperthermia, myasthenia, cardiac arrhythmias, 
episodic ataxia, migraine, Alzheimer's disease, 

20 Parkinson's disease, schizophrenia, hyperekplexia, 
anxiety, depression, phobic obsessive symptoms, 
neuropathic pain, inflammatory pain, chronic /acute pain, 
Banter • s syndrome, polycystic kidney disease. Dent's 
disease, hyper insulinemic hypoglycemia of infancy, cystic 

2 5 fibrosis, congenital stationary night blindness or total 

colour-blindness • 

Still further, it provides the use of a polypeptide 
complex for the screening of candidate pharmaceutical 
compounds „ 

3 0 Still further, it provides the use wherein high 

throughput screening techniques are employed. 

Compounds that can be screened in accordance with the 
invention include, but are not limited to peptides (such 
as soluble peptides) , phosphopep tides and small organic or 
3 5 inorganic molecules (such as natural product or synthetic 
chemical libraries and peptidomiraetics) • 
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In one embodiment , a screening assay may include a 
cell -based assay -utilising eukaryotic or prokaryotic host 
cells that are. stably transformed with recombinant 
molecules expressing the polypeptides or fragments of the 
5 invention, in competitive binding assays* Binding assays 
will measure the formation of complexes between a specific 
mutant ion channel subunit polypeptide or ion channel 
incorporating a mutant ion channel subunit polypeptide, 
and the compound being tested, or will measure the degree 
10 to which a compound being tested will inhibit or restore 
the formation of a complex between a specific mutant ion 
channel subunit polypeptide or ion channel incorporating a 
mutant ion channel subunit polypeptide, and its interactor 
or ligand. 

15 The invention is particularly useful for screening 

compounds by using the polypeptides of the invention in 
transformed cells, transf acted or injected oocytes, or 
animal models bearing mutated ion channel subunits such as 
transgenic animals or gene targeted (knock-in) animals 

20 (see transformed hosts) . Drug candidates can be added to 
cultured cells that express a single mutant ion channel 
subunit or combination of mutant ion channel subunits 
(appropriate wild -type ion channel subunits should also be 
expressed for receptor assembly) , can be added to oocytes 

25 transf ected or injected with either a mutant ion channel 
subunit or combination of mutant ion channel subunits 
(appropriate wild- type ion channel subunits must also be 
injected for receptor assembly), or can be administered to 
an animal model containing a mutant ion channel or 

30 combination of mutant ion channels. Determining the 
ability of the test compound to modulate mutant ion 
channel activity can be accomplished by a number of 
techniques known in the art. These include for example 
measuring the effect on the current of the channel (e.g. 

35 calcium-, chloride-, sodium-, potassium-ion flux) as 
compared to the current of a cell or animal containing 
wild-type ion channels. Current in cells can be measured 



by a number of approaches including the patch-clamp 
technique (methods described in Hamill et al, 1981) or 
using fluorescence based assays as are known in the art 
(see Gonzalez et al. 1999). Drug candidates that alter the 
current to a more normal level are useful for treating or 
preventing epilepsy as well as other disorders associated 
with ion channel dysfunction. 

Non cell-based assays may . also be used for 
identifying compounds that can inhibit or restore binding 
between the polypeptides of the invention or ion channels 
incorporating the polypeptides of the invention, and their 
interactors. Such assays are known in the art and include 
for example AlphaScreen technology (PerkinElmer Life 
Sciences, MA, USA) . This application relies on the use of 
beads such that each interaction partner is bound to a 
separate bead via an antibody. Interaction of each partner 
will bring the beads into proximity, such that laser 
excitation initiates a number of chemical reactions 
ultimately leading to fluorophores emitting a light 
signal. Candidate compounds that inhibit the binding of 
the' mutant ion channel subunit, or ion channel 
incorporating the mutant subunit, with its interactor will 
result in loss of light emission, while candidate 
compounds that restore the binding of the mutant ion 
channel subunit, or ion channel incorporating the mutant 
subunit, with its interactor will result in positive light 
emission. These assays ultimately enable identification 
and isolation of the candidate compounds. 

High -throughput drug screening techniques may also 
employ methods as described in WO84/03 564. Small peptide 
test compounds synthesised on a solid substrate can be 
assayed for mutant ion channel subunit polypeptide or 
mutant ion channel binding. Bound mutant ion channel or 
mutant ion channel subunit polypeptide is then detected by 
methods well known in the art. In a variation of this 
technique, purified polypeptides of the invention can be 
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coated directly onto plates to identify interacting test 
compounds . 

The invention also contemplates the use of 
competition drug screening assays in which, neutralizing 
5 antibodies capable of specifically binding the mutant ion 
channel compete with a test compound for binding thereto. 
In this manner, the antibodies can be used to detect the 
presence of any peptide that shares one or more antigenic 
determinants of the mutant ion channel, 

10 The polypeptides of the present invention may also be 

used for screening compounds developed as a result of 
combinatorial library technology. This provides a way to 
test a large number of different substances for their 
ability to modulate activity of a polypeptide- A substance 

15 identified as a modulator of polypeptide function may be 
peptide or non-peptide in nature- Non-peptide "small 
molecules" are often preferred for many in vivo 
pharmaceutical applications- In addition, a mimic or 
mimetic of the substance may be designed for 

2 0 pharmaceutical use- The design of mimetics based on a 

known pharmaceutically active compound ("lead" compound) 
is a common approach to the development of novel 
pharmaceuticals. This is often desirable where the 
original active compound is difficult or expensive to 
25 synthesise or where it provides an unsuitable method of 
administration- In the design of a mimetic, particular 
parts of the original active compound that are important 
in determining the target property are identified. These 
parts or residues constituting the active region of the 

3 0 compound are known as its pharmacophore- Once found, the 

pharmacophore structure is modelled according to its 
physical properties using data from a range of sources 
including x-ray diffraction data and NMR. A template 
molecule is then selected onto which chemical groups which 
3 5 mimic the pharmacophore can be added- The selection can be 
made such that the mimetic is easy to synthesise, is 
likely to be pharmacologically acceptable, does not 
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degrade in vivo and retains the biological activity of the 
lead compound. Further optimisation or modification can be 
carried out to select one or more final mimetics useful 
for in vivo or clinical testing. 
5 It is also possible to isolate a target-specific 

antibody and then solve its crystal structure. In 
principle, this approach yields a pharmacophore upon which 
subsequent drug design can be based as described above. It 
may he possible to avoid protein crystallography 

10 altogether by generating anti -idiotypic antibodies (anti- 
ids) to a functional, pharmacologically active antihody. 
As a mirror image of a mirror image, the binding site of 
the anti- ids would be expected to be an analogue of the 
original receptor. The anti -id could then be used to 

15 isolate peptides from chemically or biologically produced 
pept ide banks • 

Another alternative method for drug screening relies 
on structure-based rational drug design. Determination of 
the three dimensional structure of the polypeptides of the 

2 0 invention, or the three dimensional structure of the ion 

channels which incorporate these polypeptides allows for 
structure-based drug design to identify biologically 
active lead compounds • 

Three dimensional structural models can be generated 
25 by a number of applications, some of which include 
experimental models such as x-ray crystallography and NMR 
and/or from in silico studies of structural databases such 
as the Protein Databank (PDB) . In addition, three 
dimensional structural models can be determined using a 

3 0 number of known protein structure prediction techniques 

based on the primary sequences of the polypeptides (e.g. 
SYBYL - Tripos Associated, St. Louis, MO) , de novo protein 
structure design programs (e.g. MODELER - MSI Inc., San 
Diego, CA, or MOE - Chemical Computing Group, Montreal, 
3 5 Canada) or aJb initio methods (e.g. see US Patent Numbers 
5331573 and 5579250). 
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Once the three dimensional structure of a polypeptide 
or polypeptide complex has been determined, structure- 
based drug discovery techniques can be employed to design 
biologically-active compounds based on these three 
5 dimensional structures. Such techniques are known in the 
art and include examples such as DOCK (University of 
California, San Francisco) or AUTODOCK (Scripps Research 
institute, La Jolla, California) . A computational docking 
protocol will identify the active site or sites that are 

10 deemed important for protein activity based on a predicted 
protein model. Molecular databases, such as the Available 
Chemicals Directory (ACD) are then screened for molecules 
that complement the protein model. 

Using methods such as these, potential clinical drug 

15 candidates can be identified and computationally ranked in 
order to reduce the time and expense associated with 
typical *wet lab f drug screening methodologies. 

Compounds identified through screening procedures as 
described above, and which are based on the use of the 

2 0 mutant nucleic acid and polypeptides of the invention, can 
also be tested for their effect on correcting the 
functional deficit imposed by other gene mutations in 
affected individuals including other ion channel subunit 
mutations . . 

25 Such compounds form a part of the present invention, 

as do pharmaceutical compositions containing these and a 
pharmaceutically acceptable carrier. 

Pharmaceutical Preparations 

30 Compounds identified from screening assays and shown 

to restore ion channel wild-type activity can be 
administered to a patient at a therapeutically effective 
dose to treat or ameliorate epilepsy as well as other 
disorders associated with ion channel dysfunction, as 

35 described above. A therapeutically effective dose refers 
to that amount of the compound sufficient to result in 
amelioration of symptoms of the disorder. 
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Toxicity and therapeutic efficacy of such compounds 
can be determined by standard pharmaceutical procedures in 
cell cultures or experimental animals. The data obtained 
from these studies can then be used in the formulation of 
5 a range of dosages for use in humans. 

Pharmaceutical compositions for use in accordance 
with the present invention can be formulated in a 
conventional manner using one or more physiological 
acceptable carriers, excipients or stabilisers which are 

10 well known. Acceptable carriers, excipients or stabilizers 
are non- toxic at the dosages and concentrations employed, 
and include buffers such as phosphate, citrate, and other 
organic acids; antioxidants including absorbic acid; low 
molecular weight (less than about 10 residues) 

15 polypeptides; proteins, such as serum albumin, gelatin, or 
immunoglobulins; binding agents including hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids such as 
glycine,. glut amine 9 asparagine, arginine or lysine; 
monosaccharides, disaccharides, and other carbohydrates 

20 including glucose, mannose, or dextrins; chelating agents 
such as EDTA; sugar alcohols such as mannitol or sorbitol; 
salt -forming counter ions such as sodium; and/or non- ionic 
surfactants such as Tween, Pluronics or polyethylene 
glycol (PEG) . 

2 5 The formulation of pharmaceutical compositions for 

use in accordance with the present invention will be based 
on the proposed route of administration. Routes of 
administration may include, but are not limited to, 
inhalation, • insufflation (either through the mouth or 

30 nose), oral, buccal, rectal or parental administration- 

Diagnostic Applications 

Polynucleotide sequences encoding an ion channel 
subunit may be used for the diagnosis of epilepsy, as well 
35 as other as other disorders associated with ion channel 
dysfunction, including but not restricted to, hyper- or 
hypo-kalemic periodic paralysis, myotonias, malignant 
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hyperthermia, myasthenia, cardiac arrhythmias, episodic 
ataxia, migraine, Alzheimer's disease, Parkinson's 
disease, schizophrenia, hyperekplexia, anxiety, 

depression, phobic obsessive symptoms, neuropathic pain, 
5 inflammatory pain, chronic/acute pain, Bartter 1 s syndrome, 
polycystic kidney disease, Dent's disease, 

hyperinsulinemic hypoglycemia of infancy, cystic fibrosis, 
congenital stationary night blindness or total colour- 
blindness, and the use of the nucleic acid molecules 

10 incorporated as part of the invention in diagnosis of 
these disorders, or a predisposition to these disorders, 
is therefore contemplated. The nucleic acid molecules 
incorporating the novel mutation events laid out in Table 
1 may be used for this purpose. 

15 The polynucleotides that may be used for diagnostic 

purposes include oligonucleotide sequences, genomic DNA 
and complementary R*JA and DNA molecules. The 
polynucleotides may be used to detect and quantitate gene 
expression in biological samples. Genomic DNA used for the 

2 0 diagnosis may be obtained from body cells, such as those 

present in the blood, tissue biopsy, surgical specimen, or 
autopsy material. The DNA may be isolated and used 
directly for detection of a specific sequence or may be 
amplified by the polymerase chain reaction <PCR) prior to 

25 analysis. Similarly, RNA or cDNA may also be used, with or 
without PCR amplification. To detect a specific nucleic 
acid sequence, hybridisation using specific 

oligonucleotides, restriction enzyme digest and mapping, 
PCR mapping, RNAse protection, and various other methods 

30 may be employed. Oligonucleotides specific to particular 
sequences can be chemically synthesized and labelled 
radioactively or nonradioactively and hybridised to 
individual samples immobilized on membranes or other 
sol id- supports or in solution. The presence, absence or 

3 5 excess expression of any one of the mutant ion channel 

genes of the invention may then be visualized using 
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methods such as autoradiography, fluorometry, or 
color ime try* 

In a further diagnostic approach, the nucleotide 
sequences of the invention may be useful in assays that 
5 detect the presence of associated disorders, particularly 
those mentioned previously. The nucleotide sequences may 
be labelled by standard methods and added to a fluid or 
tissue sample from a patient under conditions suitable for 
the formation of hybridisation complexes. After a suitable 

10 incubation period, the sample is washed and the signal is 
guantitated and compared with a standard value. If the 
amount of signal in the patient sample is significantly 
altered in comparison to a control sample then the 
presence of altered levels of nucleotide sequences in the 

15 sample indicates the presence of the associated disorder. 
Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regimen in animal 
studies, in clinical trials, or to monitor the treatment 
of an individual patient . 

2 0 In order to provide a basis for the diagnosis or 

prognosis of epilepsy and other disorders as described 
above, which are associated with the ion channel subunit 
mutations or variants of the invention, the nucleotide 
sequence of each gene can be compared between normal 
25 tissue and diseased tissue in order to establish whether 
the patient expresses a mutant gene. 

In order to provide a basis for the diagnosis of a 
disorder associated with abnormal expression of an ion 
channel subunit gene of the invention, a normal or 

3 0 standard profile for expression is established. This may 

be accomplished by combining body fluids or cell extracts 
taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding the relevant ion 
channel subunit gene, under conditions suitable for 
35 hybridisation or amplification. Standard hybridisation may 
be quantified by comparing the values obtained from normal 
subjects with values from an experiment in which a known 
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amount of a substantially purified polynucleotide is used. 
Another method to identify a normal or standard profile 
for expression of an ion channel subunit gene is through 
quantitative RT-PCR studies . RNA isolated from body cells 
5 of a normal individual is reverse transcribed and real- 
time PCR using oligonucleotides specific for the relevant 
gene is conducted to establish a normal level of 
expression of the gene. Standard values obtained in both 
these examples may be compared with values obtained from 

10 samples from patients who are symptomatic for a disorder. 
Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a 
treatment protocol is initiated, hybridisation assays or 

15 quantitative RT-PCR studies may be repeated on a regular 
basis to determine if the level of expression in the 
patient begins to approximate that which is observed in 
the normal subject. The results obtained from successive 
assays may be used to show the efficacy of treatment over 

20 a period ranging from several days to months. 

According to a further aspect of the invention there 
is provided the use of a polypeptide as described above in 
the diagnosis of epilepsy as well as other disorders 
associated with ion channel dysfunction, including but not 

25 restricted to, hyper- or hypo-kalemic periodic paralysis, 
. myotonias, malignant hyperthermia, myasthenia, cardiac 
arrhythmias, episodic ataxia, migraine, Alzheimer's 
disease, Parkinson's disease, schizophrenia, 

hyperekplexia, anxiety, depression, phobic obsessive 

3 0 symptoms, neuropathic pain, inflammatory pain, 

chronic/acute pain, Bartter's syndrome, polycystic kidney 
disease, Dent's disease, hyperinsulinemic hypoglycemia of 
infancy, cystic fibrosis, congenital stationary night 
blindness or total colour-blindness. 

35 When a diagnostic assay is to be based upon proteins 

constituting an ion channel, a variety of approaches are 
possible. For example, diagnosis can be achieved by 
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monitoring differences in the elect rophoretic mobility of 
normal and mutant proteins that form the ion channel. Such 
an approach will be particularly useful in identifying 
mutants . in which charge substitutions are present, or in 
5 which insertions, deletions or substitutions have resulted 
in a significant change in the elect rophoretic migration 
of the resultant protein* Alternatively, diagnosis may be 
based upon differences in the proteolytic cleavage 
patterns of normal and mutant proteins, differences in 

10 molar ratios of the various amino acid residues, or by 
functional assays demonstrating altered function of the 
gene products* 

In another aspect, antibodies that specifically bind 
mutant ion channels may be used for the diagnosis of a 

15 disorder, or in assays to monitor patients being treated 
with a complete ion channel or agonists, antagonists, 
modulators or inhibitors of an ion channel. Antibodies 
useful for diagnostic purposes may be prepared in the same 
manner as described above for therapeutics. Diagnostic 

20 assays for ion channels include methods that utilise the 
antibody and a label to detect a mutant ion channel in 
human body fluids or in extracts of cells or tissues. The 
antibodies may be used with or without modification, and 
may be labelled by covalent or non-covalent attachment of 

25 a reporter molecule. 

A variety of protocols for measuring the presence of 
mutant ion channels, including but not restricted to, 
ELXSAs, RIAs, and FACS, are known in the art and provide a 
basis for diagnosing a disorder. The expression of a 

3 0 mutant ion channel or combination of mutant ion channels 
is established by combining body fluids or cell extracts 
taken from test mammalian subjects, preferably human, with 
antibody to the ion channel or channels under conditions 
suitable for complex formation. The amount of complex 

3 5 formation may be quantitated by, various methods, 
preferably by photometric means. Antibodies specific for 
the mutant ion channels will only bind to individuals 
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expressing the said mutant ion channels and not to 
individuals expressing only wild- type channels (ie normal 
individuals). This establishes the basis for diagnosing 
the disorder. 

5 Once an individual has been diagnosed with a 

disorder, effective treatments can be initiated as 
described above. Treatments can be directed to amend the 
combination of ion channel subunit mutations or may be 
directed to one mutation. 

10 

Microarray 

In further embodiments r complete cDNAs , 

oligonucleotides or longer fragments derived from any of 
the polynucleotide sequences described herein may be used 
15 as probes in a microarray. The microarray can be used to 
diagnose epilepsy, as well as other disorders associated 
with ion channel dysfunction, through the identification 
of genetic variants, mutations , and polymorphisms in the 
ion channel subunits that form part of the invention, to 

2 0 understand the genetic basis of a disorder, or can be used 

to develop and monitor the activities of therapeutic 
agent s . 

According to a further aspect of the present 
invention, tissue material obtained from animal models 
25 generated as a result of the identification of specific 
ion channel subunit human mutations (see below), 
particularly those disclosed in the present invention, can 
be used in microarray experiments. These experiments can 
be conducted to identify the level of expression of 

3 0 specific ion channel subunits, or any cDNA clones from 

whole-tissue libraries, in diseased tissue as opposed to 
normal control tissue • Variations in the expression level 
of genes, including ion channel subunits, between the two 
tissues indicates their possible involvement in the 
35 disease process either as a cause or consequence of the 
original ion channel subunit mutation present in the 
animal model. These experiments may be used to determine 
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gone function/ to understand the genetic basis of a 
disorder, to diagnose a disorder, and to develop and 
monitor the activities of therapeutic agents. Mi c roar rays 
may be prepared, used, and analyzed using methods known in 
5 the art. (For example, see Schena et al., 1996; Heller et 
al., 1997). 

Transformed Hosts 

The present invention also provides for the 

10 production of genetically modified (knock-out, knock-in 
and transgenic) , npn- human animal models transformed with 
nucleic acid molecules containing the novel ion channel 
mutations or variants as laid out in Table 1. These 
animals are useful for the study of the function of ion 

15 channels, to study the mechanisms by which combinations of 
mutations in ion channel subunits interact to give rise to 
disease and the effects of these mutations on tissue 
development, for the screening of candidate pharmaceutical 
compounds, for the creation of explanted mammalian cell 

2 0 cultures which eacpress mutant ion channels or combinations 

of mutant ion channels, and for the evaluation of 
potential therapeutic interventions. 

Animal species which are suitable for use in the 
animal models of the present invention include, but are 
25 not limited to, rats, mice, hamsters, guinea pigs, 
rabbits, dogs, cats, goats, sheep, pigs, and non-human 
primates such as monkeys and chimpanzees. For initial 
studies, genetically modified mice and rats are highly 
desirable due to the relative ease in generating knock-in, 

3 0 knock-out or transgenics of these animals, their ease of 

maintenance and their shorter life spans. For certain 
studies, transgenic yeast or invertebrates may be suitable 
and preferred because they allow for rapid screening and 
provide for much easier handling. For longer term studies, 
3 5 non-human primates may be desired due to their similarity 
with humans • 
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• 

To create an animal model for a mutated ion channel, 
or an animal model incorporating a combination of 
mutations, several methods can be employed. These include, 
but are not limited to, generation of a specific mutation 
5 in a homologous animal gene, insertion of a wild type 
human gene and/or a humanized animal gene by homologous 
recombination, insertion of a mutant (single or multiple) 
human gene as genomic or mini gene cDNA constructs using 
wild type or mutant or artificial promoter elements, or 

10 insertion of artificially modified fragments of the 
endogenous gene by homologous recombination. The 
modifications include insertion of mutant stop codons, the 
deletion of DNA sequences, . or the inclusion of 
recombination elements (lox p sites) recognized by enzymes 

15 such as Cre recombinase* 

To create transgenic mice in order to study gain of 
gene function in vivo, any mutant ion channel subun.it gene 
of the invention can be inserted into a mouse germ line 
using standard techniques such as oocyte microinjection, 

20 Gain of gene function can mean the over-easpression of a 
gene and its protein product, or the genetic 
complementation of a mutation of the gene under 
investigation. For oocyte injection, one or more copies of 
the mutant gene can be inserted into the pronucleus of a 

25 just-fertilized mouse oocyte. This oocyte is then 
reimplanted into a pseudo- pregnant foster mother. The 
live-born mice can then be screened for integrants using 
analysis of tail DNA for the presence of the relevant 
human ion channel subunit gene sequence. The transgene can 

3 0 be either a complete genomic sequence injected as a YAC, 
BAC, PAC or other chromosome DNA fragment, a cDNA with 
either the natural promoter or a heterologous promoter, or 
a mini gene containing all of the coding region and other 
elements found to be necessary for optimum expression. 

3 5 To generate knock-out mice or knock- in mice, gene 

targeting through homologous recombination in mouse 
embryonic stem (ES) cells may be applied. Knock-out mice 
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are generated to study loss of gene function in vivo while 
knock- in mice (which are preferred) allow the study of 
gain of function or to study the effect of specific gene 
mutations* Knock-in mice are similar to transgenic mice 
5 however the integration site and copy number are defined 
in the former. 

For knock-out mouse generation, gene targeting 
vectors can be designed such that they delete (knock-out) 
the protein coding sequence of the relevant ion channel 

10 subunit gene in the mouse genome. In contrast, knock-in 
mice can be produced whereby a gene targeting vector 
containing the relevant ion channel subunit gene can 
integrate into a defined genetic locus in the mouse 
genome. For both applications, homologous recombination is 

15 catalysed by specific DNA repair enzymes that recognise 
homologous DNA sequences and exchange them via double 
crossover. 

Gene targeting vectors are usually introduced into ES 
cells using elect roporat ion. ES cell integrants are then 

2 0 isolated via an antibiotic resistance gene present on the 

targeting vector and are subsequently genotyped to 
identify those ES cell clones in which the gene under 
investigation has integrated into the locus of interest. 
The appropriate ES cells are then transmitted through the 

25 germline to produce a novel mouse strain. 

In instances where gene ablation results in early 
embryonic lethality , conditional gene targeting may be 
employed. This allows genes to be deleted in a temporally 
and spatially controlled fashion. As above, appropriate ES 

30 cells are transmitted through the germline to produce a 
novel mouse strain, however the actual deletion of the 
gene is performed in the adult mouse in a tissue specific 
or time controlled manner- Conditional gene targeting is 
most commonly achieved by use of the cre/lox system* The 

3 5 enzyme ere is able to recognise the 34 base pair loxP 

sequence such that loxP flanked (or floxed) DNA is 
recognised and excised by ere. Tissue specific ere 



expression in transgenic mice enables the generation of 
tissue specific knock-out mice by mating gene targeted 
floxed mice with ere transgenic mice. Knock-out can be 
conducted in every tissue (Schwenk et al., 1995) using the 
^deleter' mouse or using transgenic mice with an inducible 
ere gene (such as those with tetracycline inducible ere 
genes), or knock-out can be tissue specific for example 
through the use of the CD19-cre mouse (Rickert et al . , 
1997). 

Once knock -in animals have been produced which 
contain a specific mutation in a particular ion channel 
subunit, mating combinations may be initiated between such 
animals so as to produce progeny containing combinations 
of two or more ion channel mutations. These animals 
effectively mimic combinations of mutations that are 
proposed to cause human IGE cases. These animal models can 
subsequently be used to study the extent and mechanisms of 
disease as related to the mutated ion channel 
combinations, as well as for the screening of candidate 
therapeutic compounds . 

According to still another aspect of the invention 
there is provided the use of genetically modified non- 
human animals as described above for the screening of 
candidate pharmaceutical compounds (see drug screening 
above) • These animals are also useful for the evaluation 
(eg therapeutic efficacy, toxicity, metabolism) of 
candidate pharmaceutical compounds, including those 
identified from the invention as described above, for the 
treatment of epilepsy as well as other as other disorders 
associated with ion channel dysfunction as described 
above . r 

Xt will be clearly understood that, although a number 
of prior art publications are referred to herein, this 
reference does not constitute an admission that any of 
these documents forms part of the common general knowledge 
in the art, in Australia or in any other country. 

Throughout this specification and the claims, the 
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words "comprise", "comprises 11 and "comprising" are used in 
a non- exclusive sense, except: where the context: retires 
otherwise. 

It will be apparent to the person skilled in the art 
5 that while the invention has been described in some detail 
for the purposes of clarity and understanding, various 
modifications and alterations to the embodiments and 
methods described herein may be made without departing 
from the scope of the inventive concept disclosed in this 
10 specification. 

Brief Description of the Drawings 

Preferred forms of the invention will now be 
described, by way of example only, with reference to the 
15 following examples and the accompanying drawings, in 
which t 

Figure 1 provides an example of ion channel subunit 
stoichiometry and the effect of multiple versus single ion 
channel subunit mutations. Figure 1A: A typical channel 

20 may have five subunit s of three different types. Figure 
IB: in out bred populations complex diseases such as 
idiopathic generalized epilepsies may be due to mutations 
in two (or more) different subunit genes. Because only one 
allele of each subunit gene is abnormal, half the 

25 expressed subunits will have the mutation. Figure 1C: In 
inbred populations, both alleles of a single subunit gene 
will be affected, so all expressed subunits will be 
mutated. Figure ID: Autosomal dominant disorders can be 
attributed to single ion channel subunit mutations that 

3 0 give rise to severe functional consequences. 

Figure 2 represents the location of mutations 
identified in the KCNQ2 ion channel subunit constituting 
the potassium channel. M: Missense mutation; T: Truncation 
mutation; F: Frameshift mutation; S: Splice site mutation. 

3 5 Figure 3 provides examples of epilepsy pedigrees 

where mutation profiles of ion channel subunits for 
individuals constituting the pedigree have begun to be 
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determined. These examples have been used to illustrate 
how the identification of novel ion channel subunit 
mutations and variations in IGE individuals can combine to 
give rise to the disorder. 

5 Figure 4 shows the results of yeast two-hybrid 

analysis of R353G and L619R KCNQ2 mutants. Yeast were 
transformed with the empty DB (BAIT) plasraid (DBLeu) , DB- 
Q2C wt, DB-Q2C R353G mutant or the DB-Q2 L619R mutant as 
indicated in A and the AD-CaM (TARGET) vector was 

10 introduced by gap-repair. Yeast control strains 
(Invitrogen™) were included on all plates for comparison. 
Control 1 has no interaction. Control 2 has a weak 
interaction. Control 3 has a moderately strong 
interaction. Control 4 has a strong interaction and 

15 control 5 has a very strong interaction. B. Growth of 
transformed yeast and controls on -leu -tryp selection. 
Yeast can grow on -leu if they contain the DB plasraid, and 
-tryp if they have AD plasmid. C. Growth of transformed 
yeast and controls on -leu -tryp -his +40mM 3 AT after 

20 48hrs. Yeast can grow on -his+3AT if the his reporter gene 
is activated by interaction between the BAIT and TARGET 
plasmids. D-F. l*acZ Filter assay for interaction between 
BAXT and TARGET plasmids, photos taken after 2hrs (D) , 
7hrs (e) and 24hrs (F) . Activation of the /?-galactosidase 

25 reporter gene by interaction of the BAIT and TARGET 
plasmids leads to the dark appearance of colonies. 

Figure 5 shows the results of CaM affinity 
experiments with the R353G and L619R KCNQ2 mutants. The 
chart below shows the values from the CPRG assay for 0- 

30 galactosidase activity as a measure of KCNQ2C-CaM binding 
efficiency. The area of each bar in the chart equates to 
the CaM binding efficiency of the BAIT. Broken lines 
indicate statistical comparison by Student's t test * 
P<0.01, ** P<0.001. 

35 
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Modes for Performing the Invention 

Potassium channels are the most diverse class of ion 
channel. The C. elegans genome encodes about 80 different 
potassium channel genes and there are probably more in 
5 mammals. About ten potassium channel genes are known to be 
mutated in human disease and include four members of the 
KCNQ gene sub-family of potassium channels. KCNQ proteins 
have six transmembrane domains, a single P-loop that forms 
the selectivity filter of the pore, a positively charged 

10 fourth transmembrane domain that probably acts as a 
voltage sensor, and intracellular amino and carboxy 
termini. The C- terminus is long and contains a conserved 
n A domain" followed by a short stretch thought to be 
involved in subunit assembly. 

15 Four KCNQ subunits are thought to combine to form a 

functional potassium channel. All five known KCNQ proteins 
can form homomeric channels in vitro and the formation of 
heteromers appears to be restricted to certain 
combinations. For instance KCNQ 2 and KCNQ 3 , which are 

20 predominantly expressed in the central nervous system, 
form a heteromultimeric channel that mediates the neuronal 
muscarinic -regulated current (M-current), also known as 
the M-channel (or M-type K* channel) . The M-current is a 
slowly activating, non- inactivating potassium conductance 

25 known to regulate neuronal excitability by determining the 
firing properties of neurons and their responsiveness to 
synaptic input (Wang et al. r 1998). Because it is the only 
current active at voltages near the threshold for action 
potential initiation, the M-current has a major impact on 

3 0 neuronal excitability. 

Sodium (the alpha subunit) and calcium channels are 
thought to have evolved from the potassium channel 
subunit, and they each consist of four domains covalently 
linked as the one molecule, each domain being equivalent 

3 5 to one of the subunits that associate to form the 
potassium channel. Each of the four domains of the sodium 
and calcium channels are comprised of six transmembrane 
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segments. 

Voltage-gated sodium channels are reoruired to 
generate the electrical excitation in neurones, heart and 
skeletal muscle fibres, which express tissue specific 
5 isoforms. Sodium channels are heteromers of a pore forming 
alpha subunit and a modulatory beta-1 subunit, with an 
additional beta-2 subunit in neuronal channels. Ten genes 
encoding sodium channel alpha subunits and 3 genes 
encoding different beta subunits have so far been 

10 identified. The beta subunits of the sodium channels do 
not associate with the alpha subunits to form any part of 
the pore, they do however affect the way the alpha pore 
forming subunit functions. 

As with sodium channels, calcium channels consist of 

15 a single pore forming alpha subunit, of which at least six 
types have been Identified to date, and several accessory 
subunits including four beta, one gamma and one alpha2- 
delta gene. Many of these subunits also encode multiple 
splice variants adding to the diversity of receptor 

20 subunits of this family of ion channels. 

The ion channels in the nAChR/GABA super family show 
a theoretical pentamerxc channel . Gamma- Aminobutyric acid 
(GABA) is the most abundant inhibitory neurotransmitter in 
the central nervous system. GABA-ergic inhibition is 

25 mediated by two major classes of receptors, type A (GABA- 
A) and type B (GABA-B) . GABA-B receptors are members of 
the class of receptors coupled to G-proteins and mediate a 
variety of inhibitory effects via secondary messenger 
cascades. GABA-A receptors are ligand-gated chloride 

30 channels that mediate rapid inhibition. 

The GABA-A channel has 16 separate, but related, 
genes encoding subunits. These are grouped on the basis of 
sequence identity into alpha, beta, gamma, delta, epsilon, 
theta and pi subunits. There are six alpha subunits (ocl- 

35 a6), three beta subunits (Pl-(33) and three gamma subunits 
<yl-y3). Bach GABA-A receptor comprises five subunits which 
may, at least in theory, be selected from any of these 
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subunits. 

Neuronal nicotinic acetylcholine receptors {nAChRs) 
consist of heterologous pentamers comprising various 
combinations of alpha subunits or alpha and beta subunits 
5 (a2-a9; p2-p4). The alpha subunits are characterised by 
adjacent cysteine residues at amino acid positions 192 and 
193, and the beta subunits by the lack of these cysteine 
residues. They are ligand-gated ion channels 
differentially expressed throughout the brain to form 
10 physiologically and pharmacologically distinct receptors 
hypothesised to mediate fast, excitatory transmission 
between neurons of the central nervous system or to 
modulate neurotransmission from their presynaptic 
position. 

15 In chicken and rat, the predominant nAChR subtype is 

composed of alpha-4 and beta-2 subunits. The transmembrane 
2 (M2) segments of the subunits are arranged as alpha 
helices and contribute to the walls of the 
neurotransmitter-gated ion channel. The alpha helices 

20 appear to be kinked and orientated in such a way that the 
side chains of the highly conserved M2-leucine residues 
project inwards when the channel is closed. ACh is thought 
to cause a conformational change by altering the 
association of the amino acid residues of M2 . The opening 

2 5 of the channel seems to be due to rotations of the gate 

forming side chains of the amino acid residues; the 
conserved polar serines and threonines may form the 
critical gate in the open channel. 

3 0 Example 1: Identification of mutations in ion channels 

Previous studies by reference (Wallace et al., 1998; 
PCT/AUO 1/00 581; Wallace et al., 2001b; Australian patent 
ATJ-B-56247/96; Steinlein et al., 1995; PCT/AU01/00541; 
Phillips et al., 2001; PCT/AU01/00729; PCT/AU01/01648 ; 
35 PCT/AU02/00910; Wallace et al., 2001a, the disclosures of 
which are incorporated herein by reference) have 
identified mutations in a number of ion channel subunits 
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associated with epilepsy. These include ion channel 
subunits of voltage-gated (eg SCN1A, SCN1B, KCNQ2, KCNQ3) 
or ligand-gated (eg CHRNA4, CHRNB2 , GABR62 , GABRD) types. 
To identify further mutations in ion channel genes, 
5 subunits which comprise the ion channels were screened for 
molecular defects in epilepsy patients. 

Human genomic sequence available from the Human 
Genome Project was used to characterize the genomic 
organisation for each subunit gene. Each gene was 

10 subsequently screened for sequence changes using single 
strand conformation polymorphism (SSCP) analysis in a 
large sample of epileptics with common sporadic 1GB 
subtypes eg juvenile myoclonic epilepsy (JME) , childhood 
absence epilepsy (CAE), juvenile absence epilepsy (JAE) 

15 and epilepsy with generalized tonic-clonic seizures (TCS) . 
Clinical observations can then be compared to the 
molecular defects characterized in order to establish the 
combinations of mutant subunits involved in the various 
disease states, and therefore to provide validated drug 

20 targets for each of these disease states- This will 
provide a basis for novel, drug treatments directed at the 
genetic defects present in each patient. 

The coding sequence for each of the ion channel 
subunits was aligned with human genomic sequence present 

25 in available databases at the National Centre for 
Biotechnology Information (NCBI) . The BLASTN algorithm was 
typically used for sequence alignment and resulted in the 
genomic organisation (intron-exon structure) of each gene 
being determined. Where genomic sequence for an ion 

30 channel subunit was not available, BACs or PACs containing 
the relevant ion channel subunit were identified through 
screening of high density filters containing these clones 
and were subsequently sequenced. 

Availability of entire genomic sequence for each ion 

35 channel subunit facilitated the design of intronic primers 
spanning each exon. These primers were used for both high 
throughput SSCP screening and direct DNA sequencing. 



- 53 - 

Example 2: Sample preparation for SSCP screening 

A large collection of individuals affected with 
epilepsy have undergone careful clinical phenotyping and 
5 additional data regarding their family history has been 
collated. Informed consent was obtained from each 
individual for blood collection and its use in subsequent 
experimental procedures . Clinical phenotypes incorporated 
classical IGE cases as well as GEFS+ and febrile seizure 
10 cases. 

DNA was extracted from collected blood using the 
QXAamp DNA Blood Maxi kit (Qiagen) according to 
manufacturers specifications or through procedures adapted 
from Wyman and White (1980). Stock DNA samples were kept 

15 at a concentration of 1 ug/ul. 

In preparation for SSCP analysis, samples to be 
screened were formatted into 96-well plates at a 
concentration of 30 ng/ul. These master plates were 
subsequently used to prepare exon specific PCR reactions 

2 0 in the 96-well format. 

Example 3: Identification of sequence alterations in ion 
. channel genes 

SSCP analysis of specific ion channel exons followed 

25 by sequencing of SSCP bandshifts was performed on 
individuals constituting the 96-well plates to identify 
sequence alterations. 

Primers used for SSCP were labelled at their 5 ' end 
with HEX and typical PCR reactions were performed in a 

30 total volume of 10 pi. All PCR reactions contained 67 mM 
Tris-HCl (pH 8.8); 16.5 mM (NH*) 2 S0 4 j 6.5 JUM EDTA; 1.5 mM 
MgCl 2 ; 200 HM each dNTP; 10% DMSO; 0.17 mg/ml BSA; 10 mM 
mercaptoethanol; 5 jig/ml each primer and 100 U/ml Taq DNA 
polymerase. PCR reactions were performed using 10 cycles 

35 of 94°C for 30 seconds, 60°C for 30 seconds, and 72°C for 
30 seconds followed by 25 cycles of 94°C for 30 seconds, 
55°C for 30 seconds, and 72°C for 3 0 seconds. A final 
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extension reaction for 10 minutes at 72°C followed. 

Twenty \il of loading dye comprising 50% <v/v) 
formamide, 12.5 mM EDTA and 0.0 2% <w/v) bromophenol blue 
were added to completed reactions which were subsequently 
5 run on non- denaturing 4% polyacryl amide gels with a cross- 
linking ratio of 35:1 ( ac rylamide : bis -acryl amide) and 
containing 2% glycerol. Gel thickness was lOOum, width 
168mm and length 160mm, Gels were run at 1200 volts and 
approximately 20mA, at 22°C and analysed on the GelScan 

10 2000 system (Corbett Research, Australia) according to 
manufacturers specifications. 

PGR products showing a conformational change were 
subsequently sequenced. This first involved re- 
araplif ication of the amplicon from the relevant individual 

15 (primers used in this instance did not contain 5' HEX 
labels) followed by purification of the PCR amplified 
templates for sequencing using QiaQuick PCR preps (Qiagen) 
based on manufacturers procedures. The primers used to 
sequence the purified ampl icons were identical to those 

* ■ 

20 used for the initial amplification step. For each 
sequencing reaction, 25 ng of primer and 10 0 ng of 
purified PCR template were used. The BigPye sequencing kit 
(ABX) was used for all sequencing reactions according to 
the manufacturers specifications. The products were run on 

2 5 an ABX 377 Sequencer and analysed using the EditView 

program. 

Table 1 shows the novel sequence changes identified 
in the ion channel subunits screened. 

3 0 Example 4: Digenic model examples 

In some instances a single mutation in an ion channel 
alone is insufficient to give rise to ah epilepsy 
phenotype. However combinations of mutations each 
conferring a subtle change of function to an ion channel, 
35 as proposed by the digenic model (PCT/AU01/00872> , may be 
sufficient to produce an epilepsy phenotype. 

Using the mutations and variations in ion channel 
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subunits that form part of this invention, the digenic 
model may be validated through a parametric analysis of 
large families in which two abnormal alleles co-segregate 
by chance to identify mutations which act co-operatively 
5 to give an epilepsy phenotype. It is envisaged that the 
strategy of careful clinical phenotyping in these large 
families, together with a linkage analysis based on the 
digenic hypothesis will allow identification of the 
mutations in ion channels associated with IGEs. If 
10 molecular genetic studies in IGE are successful using the 
digenic hypothesis, such an approach might serve as a 
model for other disorders with complex inheritance. 

The digenic hypothesis predicts that the closer the 
genetic relationship between affected individuals, the 
15 more^ similar the sub -syndromes, consistent with published 
data (Italian League Against Epilepsy Genetic 
Collaborative Group, 1993). This is because more distant 
relatives are less likely to share the same combinations 
of mutated subunits. > 
20 Identical twins have the same pair of mutated 

subunits and the same minor alleles so the sub- syndrome s 
are identical. Affected sib-pairs, including dizygous 
twins, with the same sub- syndrome would also have the same 
pair of mutated subunits, but differences in minor alleles 
25 would lead to less similarity than with monozygous twins. 
Some sib-pairs and dizygous twins, have cjuite different 
sub -syndromes; this would be due to different combinations 
of mutated subunits, when the parents have more than two 
mutated alleles between them. 
3 0 A special situation exists in inbred communities that 

parallels observations on autosomal recessive mouse 
models. Here the two mutated alleles of the digenic model 
t are the same and thus result in a true autosomal recessive 
disorder. Because all affected individuals have the same 
3 5 pair of mutated alleles, and a similar genetic background, 
the phenotypes are very similar. 

In outbred communities approximately 1% of the 

r 
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population would have IGE genotypes (2 mutated alleles) 
and 0.3% would clinically express IGE. Most of these would 
have mutations in two different channel subunits. In such 
communities most cases would appear "sporadic" as the risk 
5 to first degree relatives would be less than 10%. 

For example, let there be three IGE loci (A,B,C) and 
let the frequency of abnormal alleles (a*,b*,c*) at each 
locus be .027 and of normal alleles (a/ b, c) be .973. 
Then, the distribution of genotypes aa*, a*a, a*a* and aa 

10 at locus A will be .0263 (.027 x .973), .0263, .0007 and 
.9467 respectively, and similarly for loci B and C. In 
this population .8485 will have no mutated alleles 
C.9467 3 ), .1413 will have one mutated allele (a* or b* or 
c*; .0263 x .9467 2 x 6), .0098 will have two abnormal 

15 alleles (.0020 two same abnormal alleles, .0078, two 
different abnormal alleles) and 0.00037 will have more 
than two abnormal alleles. Thus in this population .01, or 
1%, will have two or more abnormal alleles (IGE genotype) , 
and the total abnormal allele frequency will be .08 (3 x 

20 .027). 

To determine the familial risks and allele patterns 
in affected pairs, the frequency distribution of 
population matings and the percentage of children with 2 
or more abnormal alleles must be determined. The frequency 

25 of matings with no abnormal alleles (0 x 0) is .72 
(.8485 2 ), for 1x0 and 0x1 matings .24 (2 x .8485 x 
.1413), for a 1 x 1 mating -020, and for 2x0 and 0x2 
matings .0166 etc. From this distribution of matings the 
frequency of children with 2 or more abnormal alleles can 

30 be shown to be .01. For example, the 0x2 and 2x0 
matings contribute .0033 of this .01 frequency (.0166 
[mating frequency] x .2 [chance of that mating producing a 
child with 2 or more abnormal alleles] ) . 

To determine parental risk it can be shown that of 

35 children with 2 abnormal alleles (IGE genotype), .49 
derive from lxl matings where no parent is affected, .33 
derive from a 2 x 0 and 0x2 matings etc • For the 2x0 



1 



- 57 - 

and 0x2 matings, half the parents have IGE genotypes and 
contribute .16 (.33/2) to the parental risk with the total 
parental risk of an IGE genotype being ,258. The other 
matings that contribute to affected parent-child pairs are 
5 2xl # 1x2, 3x0, 0x3 etc. 

The sibling risk of an IGE genotype is .305. For 
example 2x0 and 0x2 matings contributed .08 to the 
sibling risk (. 33 [fraction of children with 2 abnormal 
alleles] x .25 [the chance of that mating producing a child 
10 with 2 or more abnormal alleles]). Similarly the offspring 
risk was determined to be .248 by mating individuals with 

2 abnormal alleles with the general population. Thus at 

3 0% penetrance the risk for IGE phenotype for parents of a 
proband is .077, for siblings .091, and for offspring 

15 .074. 

It can be shown that affected sib pairs share the 
same abnormal allele pair in 85% of cases. This is because 
of all affected sib pairs 44% derive from lxl matings 
and 23% from 0x2 and 2x0 matings where all affected 

2 0 siblings have the same genotype. In contrast, 24% derive 
from 1x2 matings and S% from 3x1 and 2x2 matings etc 
where affected sibling genotypes sometimes differ. 

For affected parent-child pairs, genotypes are 
identical in only 5 8%. Of affected parent child pairs, 43% 

25 derive from 0x2 matings where gentoypes are identical, 
whereas 3 8% derive from 0x3 and 17% from 1x2 where the 
majority of crosses yield different affected genotypes. 

Based on the digenic model it has been postulated 
that most classical IGE and GEFS* cases are due to the 

30 combination of two mutations in mu 1 1 i - subun i t ion 
channels. These are typically point mutations resulting in 
a subtle change of function. The critical postulate is 
that two mutations, usually, but not exclusively, in 
different subunit alleles ("digenic model"), are required 

35 for clinical expression of IGE. 

The hypothesis that similar phenotypes can be caused 
by the combination of mutations in two (or more) different 
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subunits (outbred communities), or by the same mutation in 
two (or more) alleles of the same subunit (inbred 
communities), may seem implausible. However, applying the 
digenic hypothesis to the theoretical pentameric channel 
5 shown in Figure 1, in outbred communities IGE will be due 
to subunit combinations such as 0t*(Xp*pA, a*CiPpA* or aaP*PA* 
(mutated subunits indicated by* * ) ♦ In inbred communities 
Ct*OC*ppA or aap*P*A combinations might cause IGE phenotypes. 
We assume that the mutations will not cause reduced 

10 expression of the alleles and that the altered ion channel 
excitability, and consequent IGE phenotype, caused by 
mutations in two different alleles is similar to that 
caused by the same mutation in both alleles of one 
subunit. Finally, subunit mutations with more severe 

15 functional consequences (eg breaking a disulphide bridge 
in SCN1B or amino acid substitution in the pore forming 
regions of SCWlA for GEFS*) cause autosomal dominant 
generalized epilepsies with a penetrance of 60-90%. Such 
"severe" mutations are rare (allele frequency <0.01%) and 

20 are inf reorient causes of GEFS* ■ They very rarely, or 
perhaps never, cause classical IGE. 

The relative separate segregation of classical IGE 
and GEFS* phenotypes is an anecdotal clinical observation 
of ours (Singh et al«, 1999), although the separation is 

25 not absolute. The separation is supported by previous 
family and EEG studies of Poose and colleagues who 
described "type A" and "type B" liabilities which we may 
approximate the GEFS* and classical IGE groupings 
respectively (Doose and Baier, 1987). 

30 The digenic model predicts that affected sib pairs 

will share the same genes in 85% of cases whereas they 
will have at least one different allele in the remaining 
159s. in contrast, only 58% of parent-child pairs share the 
same alleles in a 3 locus model. Thus there should be 

35 greater similarity of syndromes between sibling pairs than 
parent- child pairs. This would be most objectively 
measured by age of onset and seizure types* 
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Estimates for the risk of febrile seizures or IGE in 
relatives vary. The estimates range from 5%-10% for 
siblings, 4%-6% for offspring, 3%-6% for parents, and 2-3% 
for grandparents* Underestimation may occur because IGE 
5 manifest in youth, and parents and particularly 
grandparents may be unaware of seizures in themselves in 
younger years. This is particularly true where there was 
stigma associated with epilepsy and where the epilepsy may 
have been mild and unrecognized. Underestimation of 

10 sibling and offspring risks occurs when unaffected young 
children are counted, some of whom will develop IGE in 
adolescence* Overestimation may occur with misdiagnosis of 
seizures or inclusion of seizures unrelated to IGE (e.g. 
due to trauma or tumors) 

15 Xn autosomal dominant models the risk to affected 

relatives reduces proportionally (50% for first degree 
relatives, 25% for second degree etc). For all oligogenic 
or polygenic models the risk decreases more quickly. For a 
digenic model with three loci, the risks are 9.1% for 

20 siblings, 7 .4% for offspring, 7,7% for parents. Rigorous 
measurement of the familial recurrence rates, with careful 
phenotyping and age-corrected risk estimates could be 
compared with the predictions from the digenic model, and 
it is proposed to do this. 

2 5 There is a small amount of information on IGE 

families . regarding haplotype distribution. For example, 
there is some evidence for a locus on 8q as determined by 
parametric linkage in a single family (Fong et al», 1998) 
and by non-parametric analysis in multiple small families 

30 (Zara et al . , 1995). Interestingly, in the latter study 
the 8q haplotype not infrequently came from the unaffected 
parent . This would be quite compatible with the digenic 
model and evaluation of other data sets in this manner 
could be used to test the hypothesis, and it is proposed 

35 to do this. 

Following the analysis of one large family with 
epilepsy where the two main phenotypes were childhood 
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absence epilepsy (CAE) and febrile seizures (FS), the 
inheritance of FS was found to be autosomal dominant and 
the penetrance 75%. However the inheritance of CAB in this 
family was not simple Mendelian, but suggestive of complex 
5 inheritance with the involvement of more than one gene. 
The power of this large family was used to explore the 
complex genetics of CAE further. 

Linkage analysis on this family in which individuals 
with CAE, FS and FS+ were deemed affected led to the 

10 detection of linkage on chromosome 5q and identification 
of a mutation in the GABRG2 gene (R43Q) which is localised 
to this region (Wallace et al., 2001a; FCT/AU01/00729) . 
All 10 tested individuals with FS alone in this family had 
this mutation and 7 CAE affected individuals in this 

15 family also had the mutation. To test the digenic model of 
IGEs in the CAE affected individuals, the whole genome 
screen of this family was reanalysed with only individuals 
with CAE considered affected. Linkage analysis was 
performed using FASTLINK v4.0, two-point lod scores were 

20 calculated assuming 50% penetrance and a 2% phanocopy rate 
and individuals with FS or FS+ were coded as unknown. 
Markers producing a lod score greater than 1 were 
reanalysed without a -phenocopy rate and at the observed 
penetrance f or . CAE in this family (30%). Results from the 

25 analysis revealed significant linkage to chromosome 14q22- 
<j2 3 (lod 3.4). This provides strong evidence for a second 
locus segregating with CAE affected individuals in this 
family. While the GABRG2 mutation is sufficient to cause 
FS, the CAE phenotype is thought to be due to both the 

3 0 GABRG2 mutation and a mutation occurring in a gene mapping 
to the 14q locus, as proposed by the digenic model. 

For the application of the digenic model to sporadic 
cases of IGE and affected individuals belonging to smaller 
families in which genotyping and linkage analysis is not a 

3 5 feasible approach to disease gene identification, direct 
mutation analysis of ion channel genes in these 
individuals has been carried out as described above. In 
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Table 1 there is provided an indication of novel genetic 
alterations so far identified through mutation analysis 
screening of these individuals. Figure 2 provides an 
example to indicate where some of these mutations have 
5 occurred with respect to the potassium channel KCNQ2 gene. 

The identification of novel mutations and variations 
in ion channel subunits in IGE individuals provides 
resources to further test the digenic hypothesis and 
mutation profiles are starting to accumulate for a number 
10 of subunit changes that are observed in tne same 
individuals. Figure 3 provides results from some of these 
profiles. 

Figure 3A shows a 3 generation family in which 
individual III-l has myoclonic astatic epilepsy and 

15 contains a N43del mutation in the SCN3A gene as well as an 
A1067T mutation in the SCN1A gene. Individual 1-1 also has 
the SCN3A mutation but alone this mutation is not 
sufficient to cause epilepsy in this individual* The 3CN3A 
mutation has likely been inherited from the grandfather 

2 0 through the mother , while the SCN1A mutation is likely to 
arise from the father. Both parents are unaffected but 
have yet to be screened for the presence of the mutations 
in these subunits. Individual II-l is likely to contain an 
as yet unidentified ion channel subunit mutation acting in 

25 co-operation with the SCN3A mutation already " identified in 
this individual. 

Figure 3B is another 3 generation family in which 
individual III-l has myoclonic astatic epilepsy due to a 
combination of the same SCN3A and SCN1A mutations as 

30 above. However, in this family both parents have febrile 
seizures most likely due to the presence of just one of 
the mutations in each parent, as proposed by the model. 
This is in contrast to individuals 1 1-2 and 1 1- 3 in Figure 
4A who also contain one of the mutations in these genes 

35 each. These individuals are phenotypically normal most 
likely due to incomplete penetrance of these mutations in 
each case. 
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Figure 3C shows a larger multi-generation family in 
which individual IV- 5 has a mutation in both the SCN3A and 
GABRG2 subunits. In combination, these give rise to severe 
myoclonic epilepsy of infancy but alone either cause 
5 febrile seizures (GABRG2 mutation in III-3 and IV-4) or 
are without an effect (SCN3A mutation in III-2) as 
proposed by the model. 

These examples therefore illustrate the digenic model 
as determined from mutation analysis studies of ion 
10 channel subunits in affected individuals and highlight the 
need to identify genetic alterations in the genes encoding 
ion channel subunits* 

Example 5: Analysis of ion channels and ion channel 
15 subunits 

The structure and function of the mutant ion channels 
and mutant ion channel subunits of the present invention 
can be determined using a variety of molecular biological 
studies. These studies may provide clues as to the 

20 mechanisms by which mutations in ion channel subunits 
effect the functioning of the ion channel* For instance 
the identification of proteins that interact with mutant 
ion channels (or whose interaction is impeded by a 
mutation in an ion channel subunit) may help determine the 

25 molecular mechanisms that are disrupted as a result of a 
mutation „ Procedures such as the yeast two-hybrid system 
can be used to discover and identify such interacting 
proteins • 

The principle behind the yeast two-hybrid procedure 
3 0 is that many eukaryotic transcriptional activators, 
including those in yeast, consist of two discrete modular 
domains* The first is a DNA-binding domain that binds to a 
specific promoter sequence and the second is an activation 
domain that directs the RNA polymerase II complex to 
35 transcribe the gene downstream of the DNA binding site. 
Both domains are required for transcriptional activation 
as neither domain can activate transcription on its own* 
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In the yeast two-hybrid procedure, the gene of interest or 
parts thereof (BAIT) , is cloned in such a way that it is 
expressed as a fusion to a peptide that has a DNA binding 
domain. A second gene, or number of genes, such as those 
5 from a cDNA library (TARGET) , is cloned so that it is 
expressed as a fusion to an activation domain. Interaction 
of the protein of interest with its binding partner brings 
the DNA-binding peptide together with the activation 
domain and initiates transcription of the reporter genes . 

10 The first reporter gene will select for yeast cells that 
contain interacting proteins (this reporter is usually a 
nutritional gene required for growth on selective media) . 
The second reporter is used for confirmation and while 
being expressed in response to interacting proteins it is 

15 usually not recjuired for growth. 

KCNQ 2 interactors 

Despite the identification of a number of KCNQ2 
mutations responsible for epilepsy, including those of the 

2 0 present study, the underlying biological mechanisms 

responsible for the epilepsy remains largely 
uncharacterized. Towards identifying these mechanisms, the 
large intracellular C-terminal region of KCNQ2 was 
screened for interactions with other proteins using the 
25 yeast-two hybrid procedure. The C-terminus accounts for 
63% of the KCNQ 2 protein and, in common with other KCNQ 
subunit s, contains a conserved 4 A domain' (Jentsch, 2000; 
Schwake et al., 2 000) thought to be involved in subunit 
interactions as well as another distal short conserved 

3 0 region that has been associated with subunit assembly, at 

least in KCNQ1 (Jentsch, 2000; Schmitt et al . , 2000). 

A) Yeast -two hybrid analysis 

A yeast two-hybrid screen was carried out using the 
3 5 ProQuest™ Two-Hybrid System with Gateway™ Technology 
(Invitrogen™) according to manufacturer's directions* A 
KCNQ 2 C-terminal entry* (BAIT) clone was generated using 
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the pENTR Directional TOPO® Cloning Kit (Invitrogen™). The 
following primers were designed to amplify the 
intracellular C-terminal region of KCNQ2 based on the 
sequence of human KCNQ2 (Genbank accession number 
5 NM_172107): KCNQ2F: 5 ' -CACCAAGGTTCAGGAGCAGCACAGG-3 ' and 
KCNQ2R : 5'-TCACTTCCTGGGCCCGGCCCAGCC-3' . The 1611 base pair 
cloned fragment included exon 10a (found in all our 
amplified clones), corresponding to amino acid 373-382 of 
the KCNQ2 protein. The extra 30 base pairs (10 amino 

10 acids) were included in our numbering. The PCR-product was 
cloned into the pENTR/D-TOPO® vector (Invitrogen™) via the 
TOPO 15 Cloning reaction according to the manufacturer's 
instructions. Following sequence verification, the KCNQ2 
cDNA fragment was then subcloned into pDEST™32, the DNA 

15 Binding domain (DB) Gateway™ Destination Vector 
(Invitrogen™) „ 

The ProQuest™ Two-Hybrid human brain cDNA Library 
(TARGET) with Gateway™ technology (ResGen™, Invitrogen™ 
Corporation) was amplified according to the manufacturer's 

20 instructions. Plasmid DNA was purified from the cell 
pellet using the Hi Speed Plasmid Maxi Kit (Qiagan) 
according to the manufacturer's instructions. 

Both the DBLeu < empty bait vector) and DB-KCNQ2 wild- 
type (wt) C-term BAXTS were transformed into the yeast 

2 5 strain Mav2 03 and plated onto minimal selective media 
lacking leucine, A duplicate was carried out where the 
empty library TARGET (pAD) vector was co- transformed in 
addition to each BAIT and plated onto minimal selective 
media lacking leucine ( -leu) and tryptophan (-tryp). Yeast 

30 control strains (Invitrogen™) were included on all plates. 
Control 1, used as a negative control, contained empty 
plasmids pPC97 and pPC86. Control 2 had pPC97-RB and 
pPC8 6~E2Fl, which express a relatively weak interaction. 
Control 3 contained plasmids encoding the Dzrosophlla DP 

35 (pPC97) and E2F (pPC86) domains that have a moderately 
strong interaction, and provide a control for plasmid 
shuffling. Control 4 contained pPC97-Fos and pPC86-Jun 
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which express a relatively strong interaction/ and control 
5 had a pCI*l plasmid encoding full-length GAL4p and empty 
pPC86 and was used as a positive control. 

The constructs were tested for self -activation of the 
5 his and /7-gal reporter genes according to Invitrogen™ 
instructions ♦ 

For the yeast-two hybrid screen, competent yeast 
cells were prepared for each BAIT (DB-KCNQ2 wt C-term 
construct) to be screened, transformed with 3 lug of 

10 ProQuest™ Two-Hybrid human brain AD (activation domain) - 
cDNA Library and plated onto minimal selective media 
lacking leucine (-leu), tryptophan (-tryp) and histidine 
(-his) and containing 3-aminotriazole (+3AT) . Positive 
colonies from each screen were PCR-amplif ied and re- 

15 introduced into fresh yeast cells containing the BAIT to 
re-test for two-hybrid interaction phenotypes. Those 
giving rise to more than one PGR product or that failed to 
re-test positively were systematically eliminated. 
Positives that re- tested were sequenced using the ABI 

20 PRISM® BigBye™ Terminators v3 . 0 technology. Once 
identified, the sequence of the potential interactor was 
checked to verify it was in the same translational frame 
as the Gal4p-AD encoding sequence of the prey construct. 

Approximately 3 x 10 6 clones from the ProQuest™ Two- 

2 5 Hybrid human brain cDNA library were screened for 
interaction with the DB-Q2C wt bait. Among 1039 positive 
AD-cDNAs recovered, re-tested and subsequently sequenced 
all were identified as the CALM2 gene, encoding the 
ubiquitous, Ca 2 *-binding protein, Calmodulin (CaM) . 

30 The interaction between the C- terminal region of 

KCNQ2 and CaM has also been reported by other studies (Wen 
and Levitan, 2002; Yus-Najera et al . , 2002; Gamper and 
Shapiro, 2003) . In mammals, the CaM protein is coded by a 
multigene family consisting of three bona fide members, 

35 CALM1, CALM2 and CALM3 • Within the non-coding regions of 
the CaM transcripts, no striking homology is observed, and 
codon usage is maximally divergent amongst the three CaM 
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mRNAs that encode an identical protein. It has been 
hypothesised that the existence of a multigene family 
provides a tight and complex level of regulatory control 
at the level of gene expression (Palfi et al., 2002). CaM 
5 genes are differentially expressed in the CNS during 
development and differential regulation of the CaM genes 
appears necessary to maintain the temporal and spatial 
fidelity of the CaM protein levels in all subcellular 
domains. Besides the fundamental housekeeping functions 
10 associated with CaM, it is also involved in specialized 
neuronal functions, such as the synthesis and release of 
neurotransmitters, neurite extension, long-term 

potentiation and axonal transport (Palfi et al., 2002). 

15 B) Effect of epilepsy-associated KCNQ2 mutations on the 
CaM - KCNQ2 int er act ion 

To assess the effect that the C-terminus mutations of 
the present invention had on CaM binding P two of the 
identified mutations (R353G and L619R) were introduced 

20 into the DB-Q2C construct by mutagenesis and were re- 
analysed for an interaction with CaM using the yeast two- 
hybrid procedure. 

The following primers were used to incorpqrate the 
cl057C-*G (R353G) and cl856T->G (L619R) changes into the 

25 pr>EST™32- KCNQ2 C-terminal bait construct. 

R353G F 5'-CGCCACCAACCTCTCGGGCACAGACCTGCACTC-3' 

R353G R 5 ' -GAGTGCAGGTCTGTGCCCGAGAGGTTGGTGGCG - 3 ' 

L619R F 5 ' -CTTGTCCATGGAGAAGAAGCGGGACTTCCTGGTGAATATC-3 9 

3 0 L619R R 5 ' -GATATTCACCAGGAAGTCCCGCTTCTTCTCCATGGACAAG-3 9 

Overlapping PCR products were generated using the 
TOPO 0 cloning compatible KCNQ2F primer from the initial 
cloning and the mutagenesis reverse primers, and the 
35 KCNQ2R primer from the initial cloning with the 
mutagenesis forward primers. Products were gel extracted 
and purified before a second round of PCR using the 
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initial KCNQ2 F&R primers. These products were also gel 
extracted before cloning into the pDEST™32 bait vector via 
the TOPO system (as described above) . Mutant baits were 
sequence verified. 
5 The interaction between each DB-Q2C mutant and CaM 

was then tested by the yeast two-hybrid assay and compared 
to the interaction with DB-Q2 wt. Three different PCR- 
amplified CaM positive clones from the initial screen were 
re-introduced by gap-repair 20 into the prey vector (pPC86) 
10 in the yeast strain expressing either DB-Q2C wt r DB-Q2C 
mutants or the empty DBLeu vector, used as negative 
control, 

CaM interaction with the DB-Q2C wt and mutants was 
then assessed by expression of the HIS3 and LacZ reporter 
15 genes. 

The Q2C R353G mutant did not interact with CaM # as 
seen by no growth on HIS3 selective plate (Figure 4C) and 
no blue readout in the LacZ filter assay (Figure 4D-F) . On 
the other hand, the DB-Q2C L619R mutant was shown to still 

20 interact with CaM, as seen by growth on HI S3 selective 
plate (Figure 4C) and the blue readout in the La.cZ filter 
assay. Interestingly, the DB-Q2C L619R mutant showed an 
even greater growth level on HIS 3 selective plate than the 
DB-Q2C wt and also appeared to stain faster and more 

25 intensely blue in the LacZ filter assay, suggesting a 
stronger interaction between CaM and this mutant . 

In order to better quantify P-gal activity, a second 
assay was carried out . using the high sensitivity substrate* 
Chlorophenol Red~P-D-Galactopyranoside (CPRG) in liquid 

30 culture. The affinity of the DB-Q2C/AD-CaM interaction was 
measured in terms of units of p-gal activity, with a zero 
value indicating no expression of the LacZ reporter gene, 
and hence no interaction. 

In the CFRG assay, a value of 0.05 units p-gal 

3 5 activity (Figure 5) was significantly different from the 
empty bait vector replicate (P<0.01, Student's t test), 
confirming the interaction of the DB-Q2C wt with CaM. 
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As observed in the LacZ filter assay, the CPRG assay 
showed a significant difference in the interaction between 
the Q2C R3 53G mutant and CaM as compared to the wt 
replicate (P<0.01, Student's t test, Figure 4). 
5 These results suggest that the R353G mutation alters 

the structural conformation of the KCNQ2 C-tenninal domain 
such that it is no longer able to bind to CaM and that 
this single point mutation is sufficient to abolish the 
interaction. By abolishing CaM binding, the R353G mutation 

10 could lead to an impairment of M-current in vivo due to 
decreased opening of the channel. 

In contrast, the CPRG assay for the L619R Q2C mutant 
showed a significantly higher level of |5-gal activity units 
(0.26 units) than the wt replicate (P<0.001, Student's t 

15 test, Figure 5) . This finding indicates that the L619R 
mutation alters the conformation of the protein in a 
manner that increases CaM binding affinity for the KCNQ2 
C- terminal domain by approximately 5-fold. The increased 
affinity for CaM may affect the ability of the complex to 

2 0 change conformation normally in response to calcium 

signalling. Alternatively, the marked increase in binding 
of CaM to the KCNQ2 I>619R mutant channel may be 
detrimental to the M-channel function via disruption of 
the normal neuronal inhibitory/ excitatory balance, 
25 therefore causing the seizures associated with epilepsy, 
particularly BFNS. CaM is known to be involved in both the 
excitatory and inhibitory neurotransmission pathways (Ohya 
and Botstein, 1994) and it has been proposed that the 

4 

temporal and spatial restrictions on CaM itself could 

3 0 enable the tight control of these opposing reactions 

(Toutenhoofd and Strehler, 2000). Hence/ the KCNQ2 L619R 
mutation could lead to a disruption of the local CaM pool 
consequently disturbing the finely balanced excitatory and 
inhibitory neurotransmission systems. 
3 5 These results implicate CaM in the pathogenesis of 

epilepsy and specifically in the BFNS syndrome. Whilst 
further work will be required to fully elucidate the 
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involvement of the KCNQ2-CaM interaction in neuronal 
excitability and its correlation with idiopathic epilepsy, 
these data suggest that dysfunction of this interaction 
leads to aberrant neuronal excitability in some BFNS 
5 patients. 

The calmodulin gene (and other ion channel 
interacting genes) may therefore be a target for mutation 
in epilepsy as well as other disorders associated with ion 
channel dysfunction. A mutation in an ion channel 
10 interacting gene when expressed alone, or when expressed 
in combination with one or more other ion channel 
mutations or ion channel interacting gene mutations (based 

on the digenic model), may give rise to the disorder. The 

* 

nature of the ion channel interacting genes and proteins 
15 can be studied such that these partners can also be 
targets for drug discovery. 
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Industrial Applicability 

The mutant ion channel receptor subunits of the 
invention are useful in the diagnosis and treatment of 
diseases such as epilepsy and disorders associated with 
ion channel dysfunction including, but not limited to, 
hyper- or hypo-kalemic periodic paralysis, myotonias, 
malignant hyperthermia, myasthenia, cardiac arrhythmias, 
episodic ataxia, migraine, Alzheimer's disease, 
Parkinson's disease, schizophrenia, hyperekplexia, 
anxiety, depression, phobic obsessive symptoms, 
neuropathic pain, inflammatory pain, chronic/acute pain, 
Bartter's syndrome, polycystic kidney disease. Dent's 
disease, hyperinsulinemic hypoglycemia of infancy, cystic 
fibrosis, congenital stationary night blindness and total 
colour-blindness ♦ 
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Examples of mutations and variations identified in ion channel 
subunit genes 



Subunit 
Gene 



Exon/ Intron 



DNA Mutation 



Amino Acid 
Change 



SEQ ID 
NOS 



Sodium Channel Subunit 3 

Coding exonic variants - 
SCNlB r Exon 3 

SCN2A r Exon 6 A 

SCN2A 1 Exon 16 

SCN2A r Exon 17 

SCN2A r Exon 19 

SCN2A r Exon 20 

Coding exonic variants - 
SCN2A C Exon, 12 

SCN2A C Exon 27 

Non-coding variants 



SCN1A C 

SCN2A' 

SCN2A r 

SCN2A r 

SCN2A C 

SCN2A e 

SCN2A :: 

SCN2A C 

SCN2A C 

SCN2A C 
SCN2A C 



Intron 23 

Intron 7 

Intron 19 

Intron 22 

Intron 2 

Intron 8 

Intron 11 

Intron 11 

Intron 17 

Intron 17 
Intron 17 



amino acid change 
c2 54G-*A 
c668G-»A 
c2674G-»A 
c3007C— »A 
C3 598A-X3 
c3956G-»A 

no amino acid change 
C1785T-K: 
c4919T— >A 

IVS23+33G-»A 
IVS7+61T-»A 
IVS19-55A-K3 
IVS22-31A-K3 
IVS2-28G->A 
IVS8-3T-*C 
IVS11+4 9A-K3 
IVS11-16C-»T 

IVS17-71C-^T 
IVS17-74delG 
IVS17~74insG 



Nicotinic Acetylcholine Receptor Subunifcs 

Coding exonic variants - amino acid change 
CHRNA5 r Exon 4 c400G-»A 

CHRNA2 C Exon 4 c373G-»A 

CHRNA3° Exon 2 cllOG-»A 

Coding variants - no amino acid change 

. c3 51C-»T 
c771C->T 
C159A-K3 
c291G-»A 
c3 4 5G->A 



CHRNA2" Exon 4 

CHRNA2 C Exon 5 

CHRNA3 C Exon 2 

CHRNA3 C Exon 4 

CHRNA3 C Exon 4 
Non-coding variants 

CHRNA2 C ^ Intron 3 

CHRNA3 C Intron 3 

CHRNA3 0 Intron 4 



IVS3-16C-»T 
IVS3-5T-»C 
IVS4 + 8G-»C 



R85H 
R223Q 
V892I 
L1003I 
T1200A 
R1319Q 



V134I 
A125T 
R37H 



1, 
2, 
3. 

4, 
5, 
6, 



63 
64 
65 
66 
67 
68 



7 
8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 
19 



20, 69 

21, 70 

22, 71 

23 
• 24 
25 
26 
27 

28 
29 
30 
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Potassium Channel Subun.it s 

Coding exonic variants - amino acid change 

KCNQ2 1 ' Exon 1 c204-c205insC K69fsX119 31, 72 

KCNQ2 r Exon 1 clA~K3 MlV 32 

KCNQ2 r Exon 1 c2T->C MIT 33 

KCNQ2 r Exon 8 cl057C->G R353G 34 , 73 

KCNQ2 r Exon 11 cl288C->T R430X 35, 74 

KCNQ2 r Exon 14 cl710A-»T R570S 36, 75 

KCNQ2 r Exon 15 cl856T-*G L619R 37, 76 

Non- coding variants 

KCNQ2 r Intron 9 IVS9+ (46-48) delCCT - 38 

KCNQ3 r Intron 11 IVS11+43G->A - 39 

KCNQ3 C Intron 12 IVS12+29G-»A - .40 

GABA Receptor Subunits 

Coding exonic variants - no amino acid change 

GABRB1 r Exon 5 c508C->T - 4 * 

GABRBl r Exon 9 cl329G->A - 42 

GABRB1 C Exon 8 . c975C-»T - 43 

GABRG3 C Exon 8 C9 95T— >C " 44 

Non-coding variants 

GABRA1 C 5' UTR C-142A— >G - 45 

GABRA1 C 5' UTR c-3lC->T " 46 

GABRA2 C 3' UTR cl615G-»A " 47 

GABRA5 C 5' UTR c*-271G->C ~ 48 

GABRA5 C 5' UTR C-228A-X3 " 49 

GABRA5 C 5' UTR c-149G->C ~ 50 

GABRB2 b 5' UTR C-159C-VT " 51 

GABRB2 C 3' UTR cl749C->T " 52 

GABRPi c 5' UTR c-101C-*T * 53 

GABRB1 C Intron 1 IVS1+24T->G " 54 

GABRB1 C Intron 5 IVS6+72T-K3 - 55 

GABRBl 2 Intron 7 IVS7-34A-»G " 56 

GABRB3 C Intron 1 IVS1-14C-»T - 57 

GABRB3 r Intron 7 IVS7+58delAA - 58 

GABRD r Intron 6 IVS6+132insC - 59 

GABRD 1 Intron 6 IVS6+130insC 60 

GABRD r Intron 6 IVS6+73del - - 61 

CGCGCCCACCGCCCCTTCCGCG 

GABRG3 C Intron 8 IVS8-102C->T - 62 

Note: r Mutations or variations only occurring in individuals with epilepsy; e 
Variant seen only in normal control samples; c Mutations or variants seen in 

individuals with epilepsy as well as normal control samples. The KCNQ2 

numbering is based on the large isoform (inclusion of exon 10a) - The numbering 

of Sxons and introns for SCN2A is based on the publication of Kasai et al . , 
2001. 
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Abstract 

A method of identifying a subject predisposed to a 
disorder associated with ion channel dysfunction, 
comprising ascertaining whether at least one of the genes 
encoding ion channel subunits in said subject has 
undergone a mutation event as set forth in one of SEQ ID 
Numbers: 1-62. 
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SequencesSSCPre-f ile7August03 .ST25 . txt 

SEQUENCE LISTING 

<110> Bionomics Limited 
<120> P10 

<130> SSCP Prov Update 2003 
<160> 76 

<170> Patentln version 3.2 

<210> 1 

<211> 1414 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gctcccgggg acattctaac cgccgccagg tcccgccgcc tctcgccccg ctattaatac 
60 

cggcggcccg ggaggggggc gcagcacgcg ccgcgcagcc atggggaggc tgctggcctt 
120 

agtggtcggc gcggcactgg tgtcctcagc ctgcgggggc tgcgtggagg tggactcgga 
180 

gaccgaggcc gtgtatggga tgaccttcaa aattctttgc atctcctgca agcgccgcag 
240 

cgagaccaac gctgagacct tcaccgagtg gaccttccgc »cagaagggca ctgaggagtt 
300 

tgtcaagatc ctgcgctatg agaatgaggt gttgcagctg gaggaggatg agcacttcga 
360 

gggccgcgtg gtgtggaatg gcagccgggg caccaaagac ctgcaggatc tgtctatctt 
420 

catcaccaat gtcacctaca accactcggg cgactacgag tgccacgtct accgcctgct 
480 

cttcttcgaa aactacgagc acaacaccag cgtcgtcaa^ aagatccaca ttgaggtagt 
540 

ggacaaagcc aacagagaca tggcatccat cgtgtctgag atcatgatgt atgtgctcat 
600 

tgtggtgttg accatatggc tcgtggcaga gatgatttac tgctacaaga agatcgctgc 
660 

cgccacggag actgctgcac aggagaatgc ctcggaatac ctggccatca cctctgaaag 
720 

■ * 

caaagagaac tgcacgggcg tccaggtggc cgaatagccc tggccctggg ccccgcctca 

Page 1 



Sequences SSCPre-f ile7August03 . ST25 . txt 

780 

aggaagagcc agccgtaatg gggactctcc aggcaccgcc tgcccccagc gtgggggtgg 
840 

ccaCtcctgg gccccagaaa gcctcagagt cctgccgacg gagccactgg ggtgggaggg 
900 

ggcagggggc ttggctcgca cccccacttt cgcctcctcc agctcctgcc ccgccggccg 
960 

cgcaccgcca tgcatgatgg gtaaagcaat actgccgctg cccccaccct gcttctgctg 
1020 

cctgtttggg gaggggggcg gtgaggtggg ggcagcggcc ccgcacccct cctccttgct 
1080 

gatttgcaca cattggccgc ttcagacacg cacttctggg gccagcccct ccccgcctcc 
1140 

tccctgcctg gcggcagggg tcgcgatgat gggctggagc agtttggggc agggggttct 
1200 

gggacccact ccgactcccc ctccccggca tcatttcccc tcccgcttcc tccggctgga 
1260 

cctggggtcc cccctccctg taatgcactc ctgccccggc ccaacctcgc cctctctcac 
1320 

cagccttgaa ctgtggccac ctagaaaggg gcccattcag cctcgtctct ttacagaagt 
1380 

agttttgttc atgaaataaa gactcttgga cttg 
1414 

<210> 2 

<211> 6328 

<212> DNA 

<213> Homo sapiens 

<400> 2 

ttcttggtgc cagcttatca atcccaaact ctgggtgtaa aagattctac agggcacttt 
60 

cttatgcaag gagctaaaca gtgattaaag gagcaggatg aaaagatggc acagtcagtg 
120 

ctggtaccgc caggacctga cagcttccgc ttctttacca gggaatccct tgctgctatt 
180 

gaacaacgca ttgcagaaga gaaagctaag agacccaaac aggaacgcaa ggatgaggat 
240 

gatgaaaatg gcccaaagcc aaacagtgac ttggaagcag gaaaatctct tccatttatt 
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tatggagaca ttcctccaga gatggtgtca gtgcccctgg aggatctgga cccctactat 
3 60 

atcaataaga aaacgtttat agtattgaat aaagggaaag caatctctcg attcagtgcc 
420 

acccctgccc tttacatttt aactcccttc aaccctatta gaaaattagc tattaagatt 
480 

ttggtacatt ctttattcaa tatgctcatt atgtgcacga ttcttaccaa ctgtgtattt 
540 

atgaccatga gtaaccctcc agactggaca aagaatgtgg agtatacctt tacaggaatt 
600 

tatacttttg aatcacttat taaaatactt gcaaggggct tttgtttaga agatttcaca 
660 

tttttacggg atccatggaa ttggttggat ttcacagtca ttacttttgc atatgtgaca 
720 

gagtttgtgg acctgggcaa tgtctcagcg ttgagaacat tcagagttct ccaagcattg 
780 

aaaacaattt cagtcattcc aggcctgaag accattgtgg gggccctgat ccagtcagtg 
840 

aagaagcttt ctgatgtcat gatcttgact gtgttctgtc taagcgtgtt tgcgctaata 
900 

ggattgcagt tgttcatggg caacctacga aataaatgtt tgcaatggcc tccagataat 
960 

tcttcctttg aaataaatat cacttccttc tttaacaatt cattggatgg gaatggtact 
1020 

actttcaata ggacagtgag catatttaac tgggatgaat atattgagga taaaagtcac 
1080 

ttttattttt tagaggggca aaatgatgct ctgctttgtg gcaacagctc agatgcaggc 
1140 

cagtgtcctg aaggatacat ctgtgtgaag gctggtagaa accccaacta tggctacacg 
1200 

agctttgaca cctttagttg ggcctttttg tccttatttc gtctcatgac tcaagacttc 
1260 

tgggaaaacc tttatcaact gacactacgt gctgctggga aaacgtacat gatatttttt 
1320 

gtgctggtca ttttcttggg ctcattctat ctaataaatt tgatcttggc tgtggtggcc 
1380 
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atggcctatg aggaacagaa tcaggccaca ttggaagagg ctgaacagaa ggaagctgaa 
1440 

tttcagcaga tgctcgaaca gttgaaaaag caacaagaag aagctcaggc ggcagctgca 
1500 



gccgcatctg ctgaatcaag agacttcagt ggtgctggtg ggataggagt tttttcagag 
1560 

agttcttcag tagcatctaa gttgagctcc aaaagtgaaa aagagctgaa aaacagaaga 
1620 

aagaaaaaga aacagaaaga acagtctgga gaagaagaga aaaatgacag agtcctaaaa 
1680 

tcggaatctg aagacagcat aagaagaaaa ggtttccgtt tttccttgga aggaagtagg 
1740 

ctgacatatg aaaagagatt ttcttctcca caccagtcct tactgagcat ccgtggctcc 
1800 

cttttctctc caagacgcaa cagtagggcg agccttttca gcttcagagg tcgagcaaag 
1860 

gacattggct ctgagaatga ctttgctgat gatgagcaca gcacctttga ggacaatgac 
1920 

agccgaagag actctctgtt cgtgccgcac agacatggag aacggcgcca cagcaatgtc 
1980 

agccaggcca gccgtgcctc cagggtgctc cccatcctgc ccatgaatgg gaagatgcat 
2040 

agcgctgtgg actgcaatgg tgtggtctcc ctggtcgggg gcccttctac cctcacatct 
2100 

gctgggcagc tcctaccaga gggcacaact actgaaacag aaataagaaa gagacggtcc 
2160 

agttcttatc atgtttccat ggatttattg gaagatccta catcaaggca aagagcaatg 
2220 

agtatagcca gtattttgac caacaccatg gaagaacttg aagaatccag acagaaatgc 
2280 

ccaccatgct ggtataaatt tgctaatatg tgtt tgattt gggactgttg taaaccatgg 
2340 

ttaaaggtga aacaccttgt caacctggtt gtaatggacc catttgt tga cctggccatc 
2400 

accatctgca ttgtcttaaa tacactcttc atggctatgg agcactatcc catgacggag 
2460 
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cagttcagca gtgtactgtc tgttggaaac ctggtcttca cagggatctt cacagcagaa 
2520 

atgtttctca agataattgc catggatcca tattattact ttcaagaagg ctggaatatt 
2580 



tttgatggtt ttattgtgag ccttagttta atggaacttg gtttggcaaa tgtggaagga 
2640 



ttgtcagttc tccgatcatt ccggctgctc cgagttttca agttggcaaa atcttggcca 
2700 



actctaaata tgctaattaa gatcattggc aattctgtgg gggctctagg aaacctcacc 
2760 



ttggtattgg ccatcatcgt cttcattttt gctgtggtcg gcatgcagct ctttggtaag 
2820 



agctacaaag aatgtgtctg caagatttcc aatgattgtg aactcccacg ctggcacatg 
2880 

catgactttt tccactcctt cctgatcgtg ttccgcgtgc tgtgtggaga gtggatagag 
2940 

accatgtggg actgtatgga ggtcgctggc caaaccatgt gccttactgt cttcatgatg 
3000 

gtcatggtga ttggaaatct agtggttctg aacctcttct tggccttgct tttgagttcc 
3060 

ttcagttctg acaatcttgc tgccactgat gatgataacg aaatgaataa tctccagatt 
3120 

gctgtgggaa ggatgcagaa aggaatcgat tttgttaaaa gaaaaatacg tgaatttatt 
3180 

cagaaagcct ttgttaggaa gcagaaagct ttagatgaaa ttaaaccgct tgaagatcta 
3240 

aataataaaa aagacagctg tatttccaac cataccacca tagaaatagg caaagacctc 
3300 

aattatctca aagacggaaa tggaactact agtggcatag gcagcagtgt agaaaaatat 
3360 

gtcgtggatg aaagtgatta catgtcattt ataaacaacc ctagcctcac tgtgacagta 
3420 

ccaattgctg ttggagaatc tgactttgaa aatttaaata ctgaagaatt cagcagcgag 
3480 

tcagatatgg aggaaagcaa agagaagcta aatgcaacta gttcatctga aggcagcacg 
3540 

gtfcgatattg gagctcccgc cgagggagaa cagcctgagg ttgaacctga ggaatccctt 
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3600 

gaacctgaag cctgttttac agaagactgt gtacggaagt tcaagtgttg tcagataagc 
3660 



atagaagaag gcaaagggaa actctggtgg aatttgagga aaacatgcta taagatagtg 
3720 



gagcacaatt ggttcgaaac cttcattgtc ttcatgattc tgctgagcag tggggctctg 
3780 

gcctttgaag atatatacat tgagcagcga aaaaccatta agaccatgtt agaatatgct 
3840 

gacaaggttt tcacttacat attcattctg gaaatgctgc taaagtgggt tgcatatggt 
3900 

tttcaagtgt attttaccaa tgcctggtgc tggctagact tcctgattgt tgatgtctca 
3960 

ctggttagct taactgcaaa. tgccttgggt tactcagaac ttggtgccat caaatccctc 
4020 

agaacactaa gagctctgag gccactgaga gctttgtccc ggtttgaagg aatgagggtt 
4080 

gttgtaaatg ctcttttagg agccattcca tctatcatga atgtacttct ggtttgtctg 
4140 

atcttttggc taatattcag tatcatggga gtgaatctct ttgctggcaa gttttaccat 
4200 

tgtattaatt acaccactgg agagatgttt gatgtaagcg tggtcaacaa ctacagtgag 
4260 

tgcaaagctc tcattgagag caatcaaact gccaggtgga aaaatgtgaa agtaaacttt 
4320 

gataacgtag gacttggata tctgtctcta cttcaagtag ccacgtttaa gggatggatg 
4380 

gatattatgt atgcagctgt tgattcacga aatgtagaat tacaacccaa gtatgaagac 
4440 

aacctgtaca tgtatcttta ttttgtcatc tttattattt ttggttcatt ctttaccttg 
4500 

aatcttttca ttggtgtcat catagataac ttcaaccaac agaaaaagaa gtttggaggt 
4560 

caagacattt ttatgacaga agaacagaag aaatactaca atgcaatgaa aaaactgggt 
4620 

tcaaagaaac cacaaaaacc catacctcga cctgctaaca aattccaagg aatggtcttt 
4680 
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gattttgtaa ccaaacaagt ctttgatatc agcatcatga tcctcatctg ccttaacatg 
4740 



gtcaccatga tggtggaaac cgatgaccag agtcaagaaa tgacaaacat tctgtactgg 
4800 



attaatctgg tgtttattgt tctgttcact ggagaatgtg tgctgaaact gatctctctt 
4860 



cgttactact atttcactat tggatggaat atttttgatt ttgtggtggt cattctctcc 
4920 



attgtaggaa tgtttctggc tgaactgata gaaaagtatt ttgtgtcccc taccctgttc 
4980 

cgagtgatcc gtcttgccag gattggccga atcctacgtc tgatcaaagg agcaaagggg 
5040 

atccgcacgc tgctctttgc tttgatgatg tcccttcctg cgttgtttaa catcggcctc 
5100 ' " 

cttcttttcc tggtcatgtt catctacgcc atctttggga tgtccaattt tgcctatgtt 
5160 

aagagggaag ttgggatcga tgacatgttc aactttgaga cctttggcaa cagcatgatc 
5220 

tgcctgttcc aaattacaac ctctgctggc tgggatggat tgctagcacc tattcttaat 
5280 

agtggacctc cagactgtga ccctgacaaa gatcaccctg gaagctcagt taaaggagac 
5340 

tgtgggaacc catctgttgg gattt tcttt tttgtcagtt acatcatcat atccttcctg 
5400 

gttgtgctga acatgtacat cgcggtcatc ctggagaact tcagtgttgc tactgaagaa 
5460 

agtgcagagc ctctgagtga ggatgacttt gagatgttct atgaggtttg ggagaagttt 
5520 

gatcccgatg cgacccagtt tatagagttt gccaaacttt ctgattttgc agatgccctg 
5580 

gatcctcctc ttctcatagc aaaacccaac aaagtccagc tcattgccat ggatctgccc 
5640 

atggtgagtg gtgaccggat ccactgtctt gacatcttat ttgcttttac aaagcgtgtt 
5700 

ttgggtgaga gtggagagat ggatgccctt cgaatacaga tggaagagcg attcatggca 
5760 
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tcaaacccct ccaaagtctc ttatgagccc attacgacca cgttgaaacg caaacaagag 



gaggtgtctg ctattattat ccagagggct tacagacgct acctcttgaa gcaaaaagtt 
5880 



aaaaaggtat caagtatata caagaaagac aaaggcaaag aatgtgatgg aacacccatc 
5940 . ~ 



aaagaagata ctctcattga taaactgaat gagaattcaa ctccagagaa aaccgatatg 
6000 



acgccttcca ccacgtctcc accctcgtat gatagtgtga ccaaaccaga aaaagaaaaa 
6060 



tttgaaaaag acaaatcaga aaaggaagac aaagggaaag atatcaggga aagtaaaaag 
6120 

taaaaagaaa ccaagaattt tccattttgt gatcaattgt ttacagcccg tgatggtgat 
6180 



gtgtttgtgt caacaggact cccacaggag gtctatgcca aactgactgt ttttacaaat 
6240 



gtatacttaa ggtcagtgcc tataacaaga cagagacctc tggtcagcaa actggaactc 
6300 



agtaaactgg agaaatagta tcgatggg 
6328 



<210> 3 

<211> 6328 

<212> DNA 

<213> Homo sapiens 

<400> 3 

ttcttggtgc cagcttatca atcccaaact ctgggtgtaa aagattctac agggcacttt 
60 

cttatgcaag gagctaaaca gtgattaaag gagcaggatg aaaagatggc acagtcagtg 
120 

ctggtaccgc caggacctga cagcttccgc ttctttacca gggaatccct tgctgctatt 
180 



gaacaacgca ttgcagaaga gaaagctaag agacccaaac aggaacgcaa ggatgaggat 
240 

gatgaaaatg gcccaaagcc aaacagtgac ttggaagcag gaaaatctct tccatttatt 
300 

tatggagaca ttcctccaga gatggtgtca gtgcccctgg aggatctgga cccctactat 
360 
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atcaataaga aaacgtttat agtattgaat 
42 0 



acccctgccc tttacatttt aactcccttc 
480 



ttggtacatt ctttattcaa tatgctcatt 
540 



atgaccatga gtaaccctcc agactggaca 
600 



tatacttttg aatcacttat taaaatactt 
660 



tttttacggg atccatggaa ttggttggat 
720 



gagtttgtgg acctgggcaa tgtctcagcg 
780 



aaaacaattt cagtcattcc aggcctgaag 
840 



aagaagcttt ctgatgtcat gatcttgact 
900 



ggattgcagt tgttcatggg caacctacga 
960 



tcttcctttg aaataaatat cacttccttc 
1020 



actttcaata ggacagtgag catatttaac 
1080 



ttttattttt tagaggggca aaatgatgct 
1140 



cagtgtcctg aaggatacat ctgtgtgaag 
1200 



agctttgaca cctttagttg ggcctttttg 
1260 



tgggaaaacc tttatcaact gacactacgt 
1320 



gtgctggtca ttttcttggg ctcattctat 
1380 

atggcctatg aggaacagaa tcaggccaca 
1440 

tttcagcaga tgctcgaaca gttgaaaaag 



.le7August03 . ST25 . txt 
aaagggaaag caatctctcg attcagtgcc 

aaccctatta gaaaattagc tattaagatt 

atgtgcacga ttcttaccaa ctgtgtattt 

aagaatgtgg agtatacctt tacaggaatt 

gcaaggggct tttgtttaga agatttcaca 

ttcacagtca ttacttttgc atatgtgaca 

ttgagaacat tcagagttct ccgagcattg 

accattgtgg gggccctgat ccagtcagtg 

gtgttctgtc taagcgtgtt tgcgctaata 

aataaatgtt tgcaatggcc tccagataat 

tttaacaatt cattggatgg gaatggtact 

tgggatgaat atattgagga taaaagtcac 

ctgctttgtg gcaacagctc agatgcaggc 

gctggtagaa accccaacta tggctacacg 

tccttatttc gtctcatgac tcaagacttc 

gctgctggga aaacgtacat gatatttttt 

ctaataaatt tgatcttggc tgtggtggcc 

ttggaagagg ctgaacagaa ggaagctgaa 

caacaagaag aagctcaggc ggcagctgca 
ge 9 
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1500 

gccgcatctg ctgaatcaag agacttcagt ggtgctggtg ggataggagt tttttcagag 
1560 

agttcttcag tagcatctaa gttgagctcc aaaagtgaaa aagagctgaa aaacagaaga 
1620 

aagaaaaaga aacagaaaga acagtctgga gaagaagaga aaaatgacag agtcctaaaa 
1680 

tcggaatctg aagacagcat aagaagaaaa ggtttccgtt tttccttgga aggaagtagg 
1740 

ctgacatatg aaaagagatt ttcttctcca caccagtcct tactgagcat ccgtggctcc 
1800 

cttttctctc caagacgcaa cagtagggcg agccttttca gcttcagagg tcgagcaaag 
1860 

gacattggct ctgagaatga ctttgctgat gatgagcaca gcacctttga ggacaatgac 
1920 

agccgaagag actctctgtt cgtgccgcac agacatggag aacggcgcca cagcaatgtc 
1980 

agccaggcca gccgtgcctc cagggtgctc cccatcctgc ccatgaatgg gaagatgcat 
2040 

agcgctgtgg actgcaatgg tgtggtctcc ctggtcgggg gcccttctac cctcacatct 
2100 

gctgggcagc tcctaccaga gggcacaact actgaaacag aaataagaaa gagacggtcc 
2160 

agttcttatc atgtttccat ggatttattg gaagatccta catcaaggca aagagcaatg 
2220 

agtatagcca gtat tttgac caacaccatg gaagaacttg aagaatccag acagaaatgc 
2280 

* 

ccaccatgct ggtataaatt tgctaatatg tgtttgattt gggactgttg taaaccatgg 
2340 

ttaaaggtga aacaccttgt caacctggtt gtaatggacc catttgttga cctggccatc 
2400 

accatctgca ttgtcttaaa tacactcttc atggctatgg agcactatcc catgacggag 
2460 

cagttcagca gtgtactgtc tgttggaaac ctggtcttca cagggatctt cacagcagaa 
2520 

atgtttctca agataattgc catggatcca tattattact ttcaagaagg ctggaatatt 
2580 
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tttgatggtt ttattgtgag ccttagttta 
2640 



ttgtcagttc tccgatcatt ccggctgctc 
2700 

actctaaata tgctaattaa gatcattggc 
2760 



ttggtattgg ccatcatcat cttcattttt 
2820 



agctacaaag aatgtgtctg caagatttcc 
2880 

catgactttt tccactcctt cctgatcgtg 
2940 



accatgtggg actgtatgga ggtcgctggc 
3000 



gtcatggtga ttggaaatct agtggttctg 
3060 



ttcagttctg acaatcttgc tgccactgat 
3120 



gctgtgggaa ggatgcagaa aggaatcgat 
3180 



cagaaagcct ttgttaggaa gcagaaagct 
3240 



aataataaaa aagacagctg tatttccaac 
3300 



aattatctca aagacggaaa tggaactact 
.3360 

gtcgtggatg aaagtgatta catgtcattt 
3420 



ccaattgctg ttggagaatc tgactttgaa 
3480 



tcagatatgg aggaaagcaa agagaagcta 
3540 



gttgatattg gagctcccgc cgagggagaa 
3600 

gaacctgaag cctgttttac agaagactgt 
3660 



.le7August03 . ST25 . txt 
atggaacttg gtttggcaaa tgtggaagga 

cgagttttca agttggcaaa atcttggcca 

aattctgtgg gggctctagg aaacctcacc 

gctgtggtcg gcatgcagct ctttggtaag 

aatgattgtg aactcccacg ctggcacatg 

ttccgcgtgc tgtgtggaga gtggatagag 

caaaccatgt gccttactgt cttcatgatg 

aacctcttct tggccttgct tttgagttcc 

gatgataacg aaatgaataa tctccagatt 

tttgttaaaa gaaaaatacg tgaatttatt 

ttagatgaaa ttaaaccgct tgaagatcta 

cataccacca tagaaatagg caaagacctc 

agtggcatag gcagcagtgt agaaaaatat 

ataaacaacc ctagcctcac tgtgacagta 

aatttaaata ctgaagaatt cagcagcgag 

aatgcaacta gttcatctga aggcagcacg 

cagcctgagg ttgaacctga ggaatccctt 

gtacggaagt tcaagtgttg tcagataagc 

re 11 
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atagaagaag gcaaagggaa actctggtgg aatttgagga aaacatgcta taagatagtg 
3720 

gagcacaatt ggttcgaaac cttcattgtc ttcatgattc tgctgagcag tggggctctg 
3780 



gcctttgaag atatatacat tgagcagcga aaaaccatta agaccatgtt agaatatgct 
3840 

gacaaggttt tcacttacat attcattctg gaaatgctgc taaagtgggt tgcatatggt 
3900 

tttcaagtgt attttaccaa tgcctggtgc tggctagact tcctgattgt tgatgtctca 
3960 

ctggttagct taactgcaaa tgccttgggt tactcagaac ttggtgccat caaatccctc 
4020 

agaacactaa gagctctgag gccactgaga gctttgtccc ggtttgaagg aatgagggtt 
4080 

gttgtaaatg ctcttttagg agccattcca tctatcatga atgtacttct ggtttgtctg 
4140 

atcttttggc taatattcag tatcatggga gtgaatctct ttgctggcaa gttttaccat 
4200 

tgtattaatt acaccactgg agagatgttt gatgtaagcg tggtcaacaa ctacagtgag 
4260 

tgcaaagctc tcattgagag caatcaaact gccaggtgga aaaatgtgaa agtaaacttt 
4320 

gataacgtag gacttggata tctgtctcta cttcaagtag ccacgtttaa gggatggatg 
4380 

gatattatgt atgcagctgt tgattcacga aatgtagaat tacaacccaa gtatgaagac 
4440 

aacctgtaca tgtatcttta ttttgtcatc tttattattt ttggttcatt ctttaccttg 
4500 

aatcttttca ttggtgtcat catagataac ttcaaccaac agaaaaagaa gtttggaggt 
4560 

caagacattt ttatgacaga agaacagaag aaatactaca atgcaatgaa aaaactgggt 
4620 

tcaaagaaac cacaaaaacc catacctcga cctgctaaca aattccaagg aatggtcttt 
4680 

gattttgtaa ccaaacaagt ctttgatatc agcatcatga tcctcatctg ccttaacatg 
4740 

gtcaccatga tggtggaaac cgatgaccag agtcaagaaa tgacaaacat tctgtactgg 
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4800 

attaatctgg tgtttattgt tctgttcact ggagaatgtg tgctgaaact gatctctctt 
4860 

cgttactact atttcactat tggatggaat atttttgatt ttgtggtggt cattctctcc 
4920 

attgtaggaa tgtttctggc tgaactgata gaaaagtatt ttgtgtcccc taccctgttc 
4980 

cgagtgatcc gtcttgccag gattggccga atcctacgtc tgatcaaagg agcaaagggg 
5040 

■ 

atccgcacgc tgctctttgc tttgatgatg tcccttcctg cgttgtttaa catcggcctc 
5100 

cttcttttcc tggtcatgtt catctacgcc atctttggga tgtccaattt tgcctatgtt 
5160 

aagagggaag ttgggatcga tgacatgttc aactttgaga ccbttggcaa cagcatgatc 
5220 

tgcctgttcc aaattacaac ctctgctggc tgggatggat tgctagcacc tattcttaat 
5280 

agtggacctc cagactgtga ccctgacaaa gatcaccctg gaagctcagt taaaggagac 
5340 

tgtgggaacc catctgttgg gattttctt t tttgtcagtt acatcatcat atccttcctg 
5400 

gttgtgctga acatgtacat cgcggtcatc ctggagaact tcagtgttgc tactgaagaa 
5460 

agtgcagagc ctctgagtga ggatgacttt gagatgttct atgaggtttg ggagaagttt 
5520 

gatcccgatg cgacccagtt tatagagttt gccaaacttt ctgattttgc agatgccctg 
5580 

gatcctcctc ttctcatagc aaaacccaac aaagtccagc tcattgccat ggatctgccc 
5640 

^tggtgagtg gtgaccggat ccactgtctt gacatcttat ttgcttttac aaagcgtgtt 
5700 

ttgggtgaga gtggagagat ggatgccctt cgaatacaga tggaagagcg attcatggca 
5760 

tcaaacccct ccaaagtctc ttatgagccc attacgacca cgttgaaacg caaacaagag 
5820 

gaggtgtctg ctattattat ccagagggct tacagacgct acctcttgaa gcaaaaagtt 
5880 
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a ?o^n 9gtat caa ^tatata caagaaagac aaaggcaaag aatgtgatgg aacacccatc 
5940 

aaagaagata ctctcattga taaactgaat gagaattcaa ctccagagaa aaccgatatg 

6000 .a 

acgccttcca ccacgtctcc accctcgtat gatagtgtga ccaaaccaga aaaagaaaaa 
6060 

tttgaaaaag acaaatcaga aaaggaagac aaagggaaag atatcaggga aagtaaaaag 
6X20 

taaaaagaaa ccaagaattt tccattttgt gatcaattgt ttacagcccg tgatggtgat 
6180 

gtgtttgtgt caacaggact cccacaggag gtctatgcca aactgactgt ttttacaaat 
6240 ' 

gtatacttaa ggtcagtgcc tataacaaga cagagacctc tggtcagcaa actggaactc 
63 00 

agtaaactgg agaaatagta tcgatggg 
6328 



<210> 4 

<211> 6328 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ttcttggtgc cagcttatca atcccaaact ctgggtgtaa aagattctac agggcacttt 
60 

cttatgcaag gagctaaaca gtgattaaag gagcaggatg aaaagatggc acagtcagtg 
120 

ctggtaccgc caggacctga cagcttccgc ttctttacca gggaatccct tgctgctatt 
180 

gaacaacgca ttgcagaaga gaaagctaag agacccaaac aggaacgcaa ggatgaggat 
240 

gatgaaaatg gcccaaagcc aaacagtgac ttggaagcag gaaaatctct tccatttatt 
300 

tatggagaca ttcctccaga gatggtgtca gtgcccctgg aggatctgga cccctactat 
360 

atcaataaga aaacgtttat agtattgaat aaagggaaag caatctctcg attcagtgcc 
420 

acccctgccc tttacatttt aactcccttc aaccctatta gaaaattagc tattaagatt 
480 
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ttggtacatt ctttattcaa tatgctcatt atgtgcacga ttcttaccaa ctgtgtattt 
540 



atgaccatga gtaaccctcc agactggaca aagaatgtgg agtatacctt tacaggaatt 
600 

tatacttttg aahcacttat taaaatactt gcaaggggct tttgtttaga agatttcaca 
660 



tttttacggg atccatggaa ttggttggat ttcacagtca ttacttttgc atatgtgaca 
720 



gagtttgtgg acctgggcaa tgtctcagcg ttgagaacat tcagagttct ccgagcattg 
780 



aaaacaattt cagtcattcc aggcctgaag accattgtgg gggccctgat ccagtcagtg 
840 



aagaagcttt ctgatgtcat gatcttgact gtgttctgtc taagcgtgtt tgcgctaata 
900 



ggattgcagt tgttcatggg caacctacga aataaatgtt tgcaatggcc tccagataat 
960 



tcttcctttg aaataaatat cacttccttc tttaacaatt cattggatgg gaatggtact 
i020 

actttcaata ggacagtgag catatttaac tgggatgaat atattgagga taaaagtcac 
1080 

ttttattttt tagaggggca aaatgatgct ctgctttgtg gcaacagctc agatgcaggc 
1140 



cagtgtcctg aaggatacat ctgtgtgaag gctggtagaa accccaacta tggctacacg 
1200 



agctttgaca cctttagttg ggcctttttg tccttatttc gtctcatgac tcaagacttc 
1260 

tgggaaaacc tttatcaact gacactacgt gctgctggga aaacgtacat gatatttttt 
1320 

gtgctggtca ttttcttggg ctcattctat ctaataaatt tgatcttggc tgtggtggcc 
1380 

atggcctatg aggaacagaa tcaggccaca ttggaagagg ctgaacagaa ggaagctgaa 
1440 

tfctcagcaga tgctcgaaca gttgaaaaag caacaagaag aagctcaggc ggcagctgca 
1500 

gccgcatctg ctgaatcaag agacttcagt ggtgctggtg ggataggagt tttttcagag 
1560 



Page 15 



SequencesSSCPre-f ile7August03 . ST25 . txt 
agttcttcag tagcatctaa gttgagctcc aaaagtgaaa aagagctgaa aaacagaaga 
1620 

aagaaaaaga aacagaaaga acagtctgga gaagaagaga aaaatgacag agtcctaaaa 
1680 

tcggaatctg aagacagcat aagaagaaaa ggtttccgtt tttccttgga aggaagtagg 
1740 

ctgacatatg aaaagagatt ttcttctcca caccagtcct tactgagcat ccgtggctcc 
1800 

cttttctctc caagacgcaa cagtagggcg agccttttca gcttcagagg tcgagcaaag 
• 1860 

gacattggct ctgagaatga ctttgctgat gatgagcaca gcacctttga ggacaatgac 
1920 

agccgaagag actctctgtt cgtgccgcac agacatggag aacggcgcca cagcaatgtc 
1980 

agccaggcca gccgtgcctc cagggtgctc cccatcctgc ccatgaatgg gaagatgcat 
2040 

agcgctgtgg actgcaatgg tgtggtctcc ctggtcgggg gcccttctac cctcacatct 
2100 

gctgggcagc tcctaccaga gggcacaact actgaaacag aaataagaaa gagacggtcc 
2160 

agttcttatc atgtttccat ggatttattg gaagatccta catcaaggca aagagcaatg 
2220 

agtatagcca gtattttgac caacaccatg gaagaacttg aagaatccag acagaaatgc 
2280 

ccaccatgct ggtataaatt tgctaatatg tgtttgattt gggactgttg taaaccatgg 
2340 

ttaaaggtga aacaccttgt caacctggtt gtaatggacc catttgttga cctggccatc 
2400 

accatctgca ttgtcttaaa tacactcttc atggctatgg agcactatcc catgacggag 
2460 

cagttcagca gtgtactgtc tgttggaaac ctggtcttca cagggatctt cacagcagaa 
2520 

atgtttctca agataattgc catggatcca tattattact ttcaagaagg ctggaatatt 
2580 

tttgatggtt ttattgtgag ccttagttta atggaacttg gtttggcaaa tgtggaagga 
2640 

ttgtcagttc • tccgatcatt ccggctgctc cgagttttca agttggcaaa atcttggcca 
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2700 



actctaaata tgctaattaa gatcattggc aattctgtgg gggctctagg aaacctcacc 
2760 



ttggtattgg ccatcatcgt cttcattttt gctgtggtcg gcatgcagct ctttggtaag 
2820 

agctacaaag aatgtgtctg caagatttcc aatgattgtg aactcccacg ctggcacatg 
2880 



catgactttt tccactcctt cctgatcgtg ttccgcgtgc tgtgtggaga gtggatagag 
2940 



accatgtggg actgtatgga ggtcgctggc caaaccatgt gccttactgt cttcatgatg 
3 000 



gtcatggtga ttggaaatct agtggttctg aacctcttct tggccttgct tttgagttcc 
3060 



ttcagttctg acaatcttgc tgccactgat gatgataacg aaatgaataa tatccagatt 
3120 



gctgtgggaa ggatgcagaa aggaatcgat tttgttaaaa gaaaaatacg tgaatttatt 
3180 



cagaaagcct ttgttaggaa gcagaaagct ttagatgaaa ttaaaccgct tgaagatcta 
3240 



aataataaaa aagacagctg tatttccaac cataccacca tagaaatagg caaagacctc 
3300 



aattatctca aagacggaaa tggaactact agtggcatag gcagcagtgt agaaaaatat 
3360 



gtcgtggatg aaagtgatta catgtcattt ataaacaacc ctagcctcac tgtgacagta 
3420 



ccaattgctg ttggagaatc tgactttgaa aatttaaata ctgaagaatt cagcagcgag 
3480 

tcagatatgg aggaaagcaa agagaagcta aatgcaacta gttcatctga aggcagcacg 
3540 



gttgatattg gagctcccgc cgagggagaa cagcctgagg ttgaacctga ggaatccctt 
3600 

gaacctgaag cctgttttac agaagactgt gtacggaagt tcaagtgttg tcagataagc 
3660 

atagaagaag gcaaagggaa actctggtgg aatttgagga aaacatgcta taagatagtg 
3720 

gagcacaatt ggttcgaaac cttcattgtc ttcatgattc tgctgagcag tggggctctg 
3780 
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gcctttgaag atatatacat tgagcagcga aaaaccatta agaccatgtt agaatatgct 
3840 

gacaaggttt tcacttacat attcattctg gaaatgctgc taaagtgggt tgcatatggt 
3900 

ttt.caagtgt attttaccaa tgcctggtgc tggctagact tcctgattgt tgatgfcctca 
3960 

ctggttagct taactgcaaa tgccttgggt tactcagaac ttggtgccat caaatccctc 
4020 

agaacactaa gagcfcctgag gccactgaga gctttgtccc ggtttgaagg aatgagggtt 
4080 

gttgtaaatg ctcttttagg agccattcca tctatcatga atgtacttct ggtttgtctg 
4140 

atcttttggc taatattcag tatcatggga gtgaatctct ttgctggcaa gttttaccat 
4200 

tgtattaatt acaccactgg agagatgttt gatgtaagcg tggtcaacaa ctacagtgag 
4260 

tgcaaagctc tcattgagag caatcaaact gccaggtgga aaaatgtgaa agtaaacttt 
4320 

gataacgtag gacttggata tctgtctcta cttcaagtag ccacgtttaa gggatggatg 
4380 

gatattatgt atgcagctgt tgattcacga aatgtagaat tacaacccaa gtatgaagac 
4440 

aacctgtaca tgtatcttta ttttgtcatc tttattattt ttggttcatt ctttaccttg 
4500 ' 

aatcttttca ttggtgtcat catagataac tbcaaccaac agaaaaagaa gtttggaggt 
4560 

caagacattt ttatgacaga agaacagaag aaatactaca atgcaatgaa aaaactgggt 
4620 

tcaaagaaac cacaaaaacc catacctcga cctgctaaca aattccaagg aatggtcttt 
4680 

gattttgtaa ccaaacaagb ctttgatatc agcatcatga tcctcatctg ccttaacatg 
4740 

gtcaccatga tggtggaaac cgatgaccag agtcaagaaa tgacaaacat tctgtactgg 
4800 

attaatctgg tgtttattgt tctgttcact ggagaatgtg tgctgaaact gatctctctt 
4860 
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cgttactact atttcactat tggatggaat 
4920 



attgtaggaa tgtttctggc tgaactgata 
4980 



cgagtgatcc gtcttgccag gattggccga 
5040 



atccgcacgc tgctctttgc tttgatgatg 
5100 



cttcttttcc tggtcatgtt catctacgcc 
5160 



aagagggaag ttgggatcga tgacatgttc 
5220 



tgcctgttcc aaattacaac ctctgctggc 
5280 



agtggacctc cagactgtga ccctgacaaa 
5340 



tgtgggaacc catctgttgg gatt ttcttt 
5400 

gttgtgctga acatgtacat cgcggtcatc 
5460 

agtgcagagc ctctgagtga ggatgacttt 
5520 



gatcccgatg cgacccagtt tatagagttt 
5580 

gatcctcctc ttctcatagc aaaacccaac 
5640 

atggtgagtg gtgaccggat ccactgtctt 
5700 

ttgggtgaga gtggagagat ggatgccctt 
5760 

tcaaacccct ccaaagtctc ttatgagccc 
5820 

gaggtgtctg ctattattat ccagagggct 
5880 

aaaaaggtat caagtatata caagaaagac 
5940 

aaagaagata ctctcattga taaactgaat 



le7August03 . ST25 . txt 

atttttgatt ttgtggtggt cattctctcc 

gaaaagtatt ttgtgtcccc taccctgttc 

atcctacgtc tgatcaaagg agcaaagggg 

tcccttcctg cgttgtttaa catcggcctc 

atctttggga tgtccaattt tgcctatgtt 

aactttgaga cctttggcaa cagcatgatc 

tgggatggat tgctagcacc tattcttaat 

gatcaccctg gaagctcagt taaaggagac 

tttgtcagtt acatcatcat atccttcctg 

ctggagaact tcagtgttgc tactgaagaa 

gagatgttct atgaggtttg ggagaagttt 

gccaaacttt ctgattttgc agatgccctg 

aaagtccagc tcattgccat ggatctgccc 

gacatcttat ttgcttttac aaagcghgtt 

cgaatacaga tggaagagcg attcatggca 

attacgacca cgttgaaacg caaacaagag 

tacagacgct acctcttgaa gcaaaaagtt 

aaaggcaaag aatgtgatgg aacacccatc 

gagaattcaa ctccagagaa aaccgatatg 
ye 19 



J. 
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6000 



acgccttcca ccacgtctcc accctcgtat gatagtgtga ccaaaccaga aaaagaaaaa 
6060 



tttgaaaaag acaaatcaga aaaggaagac aaagggaaag atatcaggga aagtaaaaag 
6120 



taaaaagaaa ccaagaattt tccattttgt gatcaattgt ttacagcccg tgatggtgat 
6180 



gtgtttgtgt caacaggact cccacaggag gtctatgcca aactgactgt ttttacaaat 
6240 

gtatacttaa ggtcagtgcc tataacaaga cagagacctc tggtcagcaa actggaactc 
6300 



agtaaactgg agaaatagta tcgatggg 
1 6328 



<210> 5 

<211> 6328 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ttcttggtgc cagcttatca atcccaaact ctgggtgtaa aagattctac agggcacttt 
60 



cttatgcaag gagctaaaca gtgattaaag gagcaggatg aaaagatggc acagtcagtg 
120 



ctggtaccgc caggacctga cagcttccgc ttctttacca gggaatccct tgctgctatt 
180 

gaacaacgca ttgcagaaga gaaagctaag agacccaaac aggaacgcaa ggatgaggat 
240 

. gatgaaaatg gcccaaagcc aaacagtgac ttggaagcag gaaaatctct tccatttatt 
300 

tatggagaca ttcctccaga gatggtgtca gtgcccctgg aggatctgga cccctactat 
360 

atcaataaga aaacgtttat agtattgaat aaagggaaag caatctctcg attcagtgcc 
420 

acccctgccc tttacatttt aactcccttc aaccctatta gaaaattagc tattaagatt 
480 

ttggtacatt ctttattcaa tatgctcatt atgtgcacga ttcttaccaa ctgtgtattt 
540 

atgaccatga gtaaccctcc agactggaca aagaatgtgg agtatacctt tacaggaatt 
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600 . 

-» 

tatacttttg aatcacttat taaaatactt gcaaggggct tttgtttaga agatttcaca 
660 

tttttacggg atccatggaa ttggttggat ttcacagtca ttacttttgc atatgtgaca 
72 0 

gagtttgtgg acctgggcaa tgtctcagcg ttgagaacat tcagagttct ccgagcattg 
780 

aaaacaattt cagtcattcc aggcctgaag accattgtgg gggccctgat ccagtcagtg 

aagaagcttt . ctgatgtcat gatcttgact gtgttctgtc taagcgtgtt tgcgctaata 
900 

ggattgcagt tgttcatggg caacctacga aataaatgtt tgcaatggcc tccagataat 

960 : 

tcttcctttg aaataaatat cacttccttc tttaacaatt cattggatgg gaatggtact 
102 0 

» 

actttcaata ggacagtgag catatttaac tgggatgaat atattgagga taaaagtcac 
1080 

ttttattttt tagaggggca aaatgatgct ctgctttgtg gcaacagctc agatgcaggc 
1140 

cagtgtcctg aaggatacat ctgtgtgaag gctggtagaa accccaacta tggctacacg 
12 00 

■ 

agctttgaca cctttagttg ggcctttttg tccttatttc gtctcatgac tcaagacttc 
1260 

tgggaaaacc tttatcaact gacactacgt gctgctggga aaacgtacat gatatttttt 
1320 

gtgctggtca ttttcttggg ctcattctat .ctaataaatt tgatcttggc tgtggtggcc 
1380 

atggcctatg aggaacagaa tcaggccaca ttggaagagg ctgaacagaa ggaagctgaa 
1440 

tttcagcaga tgctcgaaca gttgaaaaag caacaagaag aagctcaggc ggcagctgca 
1500 

gccgcatctg ctgaatcaag agacttcagt ggtgctggtg ggataggagt tttttcagag 
1560 

agttcttcag tagcatctaa gttgagctcc aaaagtgaaa aagagctgaa aaacagaaga 
1620 

aagaaaaaga aacagaaaga acagtctgga gaagaagaga aaaatgacag agtcctaaaa 
1680 
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Sequen- 

tcggaatctg aagacagcat 
1740 



ctgacatatg aaaagagatt 
1800 



cttttctctc caagacgcaa 
1860 



gacattggct ctgagaatga 
1920 

agccgaagag actctctgtt 
1980 



agccaggcca gccgtgcctc 
2040 



agcgctgtgg actgcaatgg 
2100 



gctgggcagc tcctaccaga 
2160 



agttcttatc atgtttccat 
2220 



agtatagcca gtattttgac 
2280 

ccaccatgct ggtataaatt 
2340 

ttaaaggtga aacaccttgt 
2400 



accatctgca ttgtcttaaa 
2460 

cagttcagca gtgtactgtc 
2520 



atgtttctca agataattgc 
2580 

tttgatggtt ttattgtgag 
2640 

ttgtcagttc tccgatcatt 
2700 

actctaaata tgctaattaa 
2760 



:esSSCPre-f ile7August0 
aagaagaaaa ggtttccgtt 

ttcttctcca caccagtcct 

cagtagggcg agccttttca 

ctttgctgat gatgagcaca 

cgtgccgcac agacatggag 

cagggtgc tc cccatcc tgc 

tgtggtctcc ctggtcgggg 

gggcacaact actgaaacag 

ggatttattg gaagatccta 

caacaccatg gaagaacttg 

tgctaatatg tgtttgattt 

caacctggtt gtaatggacc 

tacactcttc atggctatgg 

tgttggaaac ctggtcttca 

catggatcca tattattact 

ccttagttta atggaacttg 

ccggctgctc cgagttttca 

gatcattggc aattctgtgg 
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tttccttgga aggaagtagg 
tactgagcat ccgtggctcc 
gcttcagagg tcgagcaaag 
gcacctttga ggacaatgac 
aacggcgcca , cagcaatgtc 
ccatgaatgg gaagatgcat 
gcccttctac cctcacatct 
aaataagaaa gagacggtcc 
catcaaggca aagagcaatg 
aagaatccag acagaaatgc 
gggactgttg taaaccatgg 
catttgttga cctggccatc 
agcactatcc catgacggag 
cagggatctt cacagcagaa 
ttcaagaagg ctggaatatt 
gtttggcaaa tgtggaagga 
agttggcaaa atcttggcca 
gggctctagg aaacctcacc 



SequencesSSCPre-f ile7August03 . ST25 . txt 
ttggtattgg ccatcatcgt cttcattttt gctgtggtcg gc'atgcagct ctttggtaag 
282 0 

agctacaaag aatgtgtctg caagatttcc aatgattgtg aactcccacg ctggcacatg 
2880 



catgactttt tccactcctt cctgatcgtg ttccgcgtgc tgtgtggaga gtggatagag 
2940 



accatgtggg actgtatgga ggtcgctggc caaaccatgt gccttactgt cttcatgatg 
3000 



gtcatggtga ttggaaatct agtggttctg aacctcttct tggccttgct tttgagttcc 
3060 



ttcagttctg acaatcttgc tgccactgat gatgataacg aaatgaataa tctccagatt 
3120 

gctgtgggaa ggatgcagaa aggaatcgat tttgttaaaa gaaaaatacg tgaatttatt 
3180 

cagaaagcct ttgttaggaa gcagaaagct ttagatgaaa ttaaaccgct tgaagatcta 
3240 



aataataaaa aagacagctg tatttccaac cataccacca tagaaatagg caaagacctc 
3300 

aattatctca aagacggaaa tggaactact agtggcatag gcagcagtgt agaaaaatat 
3360 



gtcgtggatg aaagtgatta catgtcattt ataaacaacc ctagcctcac tgtgacagta 
3420 

ccaattgctg ttggagaatc tgactttgaa aatttaaata ctgaagaatt cagcagcgag 
3480 

tcagatatgg aggaaagcaa agagaagcta aatgcaacta gttcatctga aggcagcacg 
3540 



gttgatattg gagctcccgc cgagggagaa .cagcctgagg ttgaacctga ggaatccctt 
3600 

gaacctgaag cctgttttac agaagactgt gtacggaagt tcaagtgttg tcagataagc 
3660 

atagaagaag gcaaagggaa actctggtgg aatttgagga aagcatgcta taagatagtg 
3720 

gagcacaatt ggttcgaaac cttcattgtc ttcatgattc tgctgagcag tggggctctg 
3780 

gcctttgaag atatatacat tgagcagcga aaaaccatta agaccatgtt agaatatgct 
3840 

gacaaggttt tcacttacat attcattctg gaaatgctgc taaagtgggt tgcatatggt 

Page 23 



SequencesSSCPre-f ile7August03 . ST25 . txt 

3900 



tttcaagtgt attttaccaa tgcctggtgc tggctagact tcctgattgt tgatgtctca 
3960 



ctggttagct taactgcaaa tgccttgggt tactcagaac ttggtgccat caaatccctc 
4020 



agaacactaa gagctctgag gccactgaga gctttgtccc ggtttgaagg aatgagggtt 
4080 



gttgtaaatg ctcttttagg agccattcca tctatcatga atgtacttct ggtttgtctg 
4140 



atcttttggc taatattcag tatcatggga gtgaatctct ttgctggcaa gttttaccat 
4200 



tgtattaatt acaccactgg agagatgttt gatgtaagcg tggtcaacaa ctacagtgag 
4260 



tgcaaagctc tcattgagag caatcaaact gccaggtgga aaaatgtgaa agtaaacttt 
4320 



gataacgtag gacttggata tctgtctcta cttcaagtag ccacgtttaa gggatggatg 
4380 

gatattatgt atgcagctgt tgattcacga aatgtagaat tacaacccaa gtatgaagac 
4440 

aacctgtaca tgtatcttta ttttgtcatc tttattattt ttggttcatt ctttaccttg 
4500 

aatcttttca ttggtgtcat catagataac ttcaaccaac agaaaaagaa gtttggaggt 
4560 

caagacattt ttatgacaga agaacagaag aaatactaca atgcaatgaa aaaactgggt 
4620 

tcaaagaaac cacaaaaacc catacctcga cctgctaaca aattccaagg aatggtcttt 
4680 

gattttgtaa ccaaacaagt ctttgatatc agcatcatga tcctcatctg ccttaacatg 
4740 

gtcaccatga tggtggaaac cgatgaccag agtcaagaaa tgacaaacat tctgtactgg 
4800 

attaatctgg tgtttattgt tctgttcact ggagaatgtg tgctgaaact gatctctctt 
4860 

cgttactact atttcactat tggatggaat atttttgatt ttgtggtggt cattctctcc 
4920 

attgtaggaa tgtttctggc tgaactgata gaaaagtatt ttgtgtcccc taccctgttc 
4980 
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cgagtgatcc gtcttgccag gattggccga atcctacgtc tgatcaaagg agcaaagggg 
5040 

atccgcacgc tgctctttgc tttgatgatg tcccttcctg cgttgtttaa catcggcctc 

5100 ■ • . 

* 

cttcttttcc tggtcatgtt catctacgcc atctttggga tgtccaattt tgcctatgtt 
5160 

aagagggaag ttgggatcga tgacatgttc aactttgaga cctttggcaa cagcatgatc 
5220 

tgcctgttcc aaattacaac ctctgctggc tgggatggat tgctagcacc tattcttaat 
5280 

agtggacctc cagactgtga ccctgacaaa gatcaccctg gaagctcagt taaaggagac 
5340 

tgtgggaacc catctgttgg gattttcttt tttgtcagtt acatcatcat atccttcctg 
5400 

gttgtgctga acatgtacat cgcggtcatc ctggagaact tcagtgttgc tactgaagaa 
5460 

agtgcagagc ctctgagtga ggatgacttt gagatgttct atgaggtttg ggagaagttt 
5520 

gatcccgatg cgacccagtt tatagagttt gccaaacttt ctgattttgc agatgccctg 
5580 

gatcctcctc ttctcatagc aaaacccaac aaagtccagc tcattgccat ggatctgccc 
5640 

atggtgagtg gtgaccggat ccactgtctt gacatcttat ttgcttttac aaagcgtgtt 
5700 

ttgggtgaga gtggagagat ggatgccctt cgaatacaga tggaagagcg attcatggca 
5760 

tcaaacccct ccaaagtctc ttatgagccc attacgacca cgttgaaacg caaacaagag 
5820 

gaggtgtctg ctattattat ccagagggct tacagacgct acctcttgaa gcaaaaagtt 
5880 

aaaaaggtat caagtatata caagaaagac aaaggcaaag aatgtgatgg aacacccatc 
5940 

aaagaagata ctctcattga taaactgaat gagaattcaa ctccagagaa aaccgatatg 
6000 

acgccttcca ccacgtctcc accctcgtat gatagtgtga ccaaaccaga aaaagaaaaa 
6060 
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tttgaaaaag acaaatcaga aaaggaagac aaagggaaag atatcaggga aagtaaaaag 
6120 

taaaaagaaa ccaagaattt tccattttgt gatcaattgt ttacagcccg tgatggtgat 
6180 

gtgtttgtgt caacaggact cccacaggag gtctatgcca aactgactgt ttttacaaat 

gtatacttaa ggtcagtgcc tataacaaga cagagacctc tggtcagcaa actggaactc 
6300 

agtaaactgg agaaatagta tcgatggg 
6328 

<210> 6 

<211> 6328 

<212> DNA 

<213> Homo sapiens 

<400> 6 

ttcttggtgc cagcttatca atcccaaact ctgggtgtaa aagattctac agggcacttt 
60 

cttatgcaag gagctaaaca gtgattaaag gagcaggatg aaaagatggc acagtcagtg 
120 

ctggtaccgc caggacctga cagcttccgc ttctttacca gggaatccct tgctgctatt 
180 

gaacaacgca ttgcagaaga gaaagctaag agacccaaac aggaacgcaa ggatgaggat 
240 

gatgaaaatg gcccaaagcc aaacagtgac ttggaagcag gaaaatctct tccatttatt 
300 

tatggagaca ttcctccaga gatggtgtca gtgcccctgg aggatctgga cccctactat 
360 

atcaataaga aaacgtttat agtattgaat aaagggaaag caatctctcg attcagtgcc 
420 

acccctgccc tttacatttt aactcccttc aaccctatta gaaaattagc tattaagatt 
480 

ttggtacatt ctttattcaa tatgctcatt atgtgcacga ttcttaccaa ctgtgtattt 
540 

atgaccatga gtaaccctcc agactggaca aagaatgtgg agtatacctt tacaggaatt 
600 

tatacttttg aatcacttat taaaatactt gcaaggggct tttgtttaga agatttcaca 
660 
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tttttacggg atccatggaa ttggttggat ttcacagtca ttacttttgc atatgtgaca 
720 

gagtttgtgg acctgggcaa tgtctcagcg ttgagaacat tcagagttct ccgagcattg 
780 



aaaacaattt cagtcattcc aggcctgaag accattgtgg gggccctgat ccagtcagtg 
'840 



aagaagcttt ctgatgtcat gatcttgact gtgttctgtc taagcgtgtt tgcgctaata 
900 



ggattgcagt tgttcatggg caacctacga aataaatgtt tgcaatggcc tccagataat 
960 



tcttcctttg aaataaatat cacttccttc tttaacaatt cattggatgg gaatggtact 
1020 

actttcaata ggacagtgag catatttaac tgggatgaat atattgagga taaaagtcac 
1080 

ttttattttt tagaggggca aaatgatgct ctgctttgtg gcaacagctc agatgcaggc 
1140 

cagtgtcctg aaggatacat ctgtgtgaag gctggtagaa accccaacta tggctacacg 
1200 

agctttgaca cctttagttg ggcctttttg tccttatttc gtctcatgac tcaagacttc 
1260 

tgggaaaacc tttatcaact gacactacgt gctgctggga aaacgtacat gatatttttt 
1320 

gtgctggtca ttttcttggg ctcattctat ctaataaatt tgatcttggc tgtggtggcc 
1380 . 

atggcctatg aggaacagaa tcaggccaca ttggaagagg ctgaacagaa ggaagctgaa 
1440 

tttcagcaga tgctcgaaca gttgaaaaag caacaagaag aagctcaggc ggcagctgca 
1500 

gccgcatctg ctgaatcaag agacttcagt ggtgctggtg ggataggagt tttttcagag 
1560 

agttcttcag tagcatctaa gttgagctcc aaaagtgaaa aagagctgaa aaacagaaga 
1620 

aagaaaaaga aacagaaaga acagtctgga gaagaagaga aaaatgacag agtcctaaaa 
1680 

tcggaatctg aagacagcat aagaagaaaa ggtttccgtt tttccttgga aggaagtagg 
1740 

ctgacatatg aaaagagatt ttcttctcca caccagtcct tactgagcat ccgtggctcc 
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1800 



cttttctctc caagacgcaa cagtagggcg 
1860 

gacattggct ctgagaatga ctttgctgat 
1920 



agccgaagag actctctgtt cgtgccgcac 
1980 



agccaggcca gccgtgcctc cagggtgctc 
2040 



agcgctgtgg actgcaatgg tgtggtctcc 
, 2100 

gctgggcagc tcctaccaga gggcacaact 
2160 



agttcttatc atgtttccat ggatttattg 
2220 



agtatagcca gtattttgac caacaccatg 
2280 



ccaccatgct ggtataaatt tgctaatatg 
2340 



ttaaaggtga aacaccttgt caacctggtt 
2400 



accatctgca ttgtcttaaa tacactcttc 
2460 



cagttcagca gtgtactgtc tgttggaaac 
2520 

atgtttctca agataattgc catggatcca 
2580 



tttgatggtt ttattgtgag ccttagttta 
2640 

ttgfccagttc tccgatcatt ccggctgctc 
2700 



actctaaata tgctaattaa gatcattggc 
2760 

ttggtattgg ccatcatcgt cttcattttt 
2820 

agctacaaag aatgtgtctg caagatttcc 
2880 



.le7Augus t03 . ST25 . txt 
agccttttca gcttcagagg tcgagcaaag 
gatgagcaca gcacctttga ggacaatgac 
agacatggag aacggcgcca cagcaatgtc 
cccatcctgc ccatgaatgg gaagatgcat 
ctggtcgggg gcccttctac cctcacatct 
actgaaacag aaataagaaa gagacggtcc 
gaagatccta catcaaggca aagagcaatg 
gaagaacttg . aagaatccag acagaaatgc 
tgtttgattt gggactgttg taaaccatgg 
gtaatggacc catttgttga cctggccatc 
atggctatgg agcactatcc catgacggag 
ctggtcttca cagggatctt cacagcagaa 
tattattact ttcaagaagg ctggaatatt 
atggaacttg gtttggcaaa tgtggaagga 
cgagttttca agttggcaaa atcttggcca 
aattctgtgg gggctctagg aaacctcacc 
gctgtggtcg gcatgcagct ctttggtaag 

aatgattgtg aactcccacg ctggcacatg 
re 28 
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c ^tgactttt tccactcctt cctgatcgtg ttccgcgtgc tgtgtggaga gtggatagag 

accatgtggg actgtatgga ggtcgctggc caaaccatgt gccttactgt' cttcatgatg 
3000 

gtcatggtga ttggaaatct agtggttctg aacctcttct tggccttgct tttgagttcc 
3060 

ttcagttctg acaatcttgc tgccactgat gatgataacg aaatgaataa tctccagatt 
3120 

gctgtgggaa ggatgcagaa aggaatcgat tttgttaaaa gaaaaatacg tgaatttatt 
3180 

, cagaaagcct ttgttaggaa gcagaaagct ttagatgaaa ttaaaccgct tgaagatcta 
3240 

aataataaaa aagacagcbg tatttccaac cataccacca tagaaatagg caaagacctc 
3300 

aattatctca aagacggaaa tggaactact agtggcatag gcagcagtgt agaaaaatat 
3360 

gtcgtggatg aaagtgatta catgtcattt ataaacaacc ctagcctcac tgtgacagta 
3420 

ccaattgctg ttggagaatc tgactttgaa aatttaaata ctgaagaatt cagcagcgag 
3480 

tcagatatgg aggaaagcaa agagaagcta aatgcaacta gttcatctga aggcagcacg 
3540 

gttgatattg gagctcccgc cgagggagaa cagcctgagg ttgaacctga ggaatccctt 
3600 

gaacctgaag cctgttttac agaagactgt gtacggaagt tcaagtgttg tcagataagc 
3660 

atagaagaag ' gcaaagggaa actctggtgg aatttgagga aaacatgcta taagatagtg. 
3720 

gagcacaatt ggttcgaaac cttcattgtc ttcatgattc tgctgagcag tggggctctg 
3780 

gcctttgaag atatatacat tgagcagcga aaaaccatta agaccatgtt agaatatgct 
3840 

gacaaggttt tcacttacat attcattctg gaaatgctgc taaagtgggt tgcatatggt 
3900 

tttcaagtgt attttaccaa tgcctggtgc tggctagact tcctgattgt tgatgtctca 
3960 
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ctggttagct taactgcaaa tgccttgggt 
4020 



agaacactaa gagctctgag gccactgaga 
4080 



gttgtaaatg ctcttttagg agccattcca 
4140 



atcttttggc taatattcag tatcatggga 
4200 



tgtattaatt acaccactgg agagatgttt 
4260 



tgcaaagctc tcattgagag caatcaaact 
4320 

gataacgtag gacttggata tctgtctcta 
4380 



gatattatgt atgcagctgt tgattcacga 
4440 



aacctgtaca tgtatcttta ttttgtcatc 
4500 



aatct tttca ttggtgtcat catagataac 
4560 



caagacattt ttatgacaga agaacagaag 
.4620 

tcaaagaaac cacaaaaacc catacctcga 
4680 



gattttgtaa ccaaacaagfc ctttgatatc 
4740 



gtcaccatga tggtggaaac cgatgaccag 
4800 



attaatctgg tgtttattgt tctgttcact 
4860 

cgttactact afcttcacta t tggatggaat 
4920 



attgtaggaa tgtttctggc tgaactgata 
4980 



cgagtgatcc gfccttgccag gattggccga 
5040 

atccgcacgc tgctctttgc tttgatgatg 



Lle7August03 . ST25 . txt 
tactcagaac ttggtgccat caaatccctc 

gctttgtccc agtttgaagg aatgagggtt 

tctatcatga atgtacttct ggtttgtctg 

gtgaatctct ttgctggcaa gttttaccat 

gatgtaagcg tggtcaacaa ctacagtgag 

gccaggtgga aaaatgtgaa agtaaacttt 

cttcaagbag ccacgtttaa gggatggatg 

» 

aatgtagaat tacaacccaa gtatgaagac 

tttattattt ttggttcatt ctttaccttg 

ttcaaccaac agaaaaagaa gtttggaggt 

aaatactaca atgcaatgaa aaaactgggt 

cctgctaaca aattccaagg aatggtcttt 

agcatcatga tcctcatctg ccttaacatg 

agtcaagaaa tgacaaacat tctgtactgg 

ggagaatgtg tgctgaaact gatctctctt 

atttttgatt ttgtggtggt cattctctcc 

gaaaagtatt ttgtgtcccc taccctgttc 

atcctacgtc tgatcaaagg agcaaagggg 

tcccttcctg cgttgtttaa catcggcctc 
?e 30 
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5100 

cttcttttcc tggtcatgtt catctacgcc atctttggga tgtccaattt tgcctatgtt 
5160 

aagagggaag ttgggatcga tgacatgttc aactttgaga cctttggcaa cagcatgatc 
5220 

tgcctgttcc aaattacaac ctctgctggc tgggatggat tgctagcacc tattcttaat 
5280 

agtggacctc cagactgtga ccctgacaaa gatcaccctg gaagctcagt taaaggagac 
5340 

tgtgggaacc catctgttgg gattttcttt tttgtcagtt acatcatcat atccttcctg 
5400 

gttgtgctga acatgtacat cgcggtcatc ctggagaact tcagtgttgc tactgaagaa 
5460 

agtgcagagc ctctgagtga ggatgacttt gagatgttct atgaggtttg ggagaagttt 
5520 

gatcccgatg cgacccagtt tatagagttt gccaaacttt ctgattttgc agatgccctg 
5580 

gatcctcctc ttctcatagc aaaacccaac aaagtccagc tcattgccat ggatctgccc 
5640 

atggtgagtg gtgaccggat ccactgtctt gacatcttat ttgcttttac aaagcgtgtt 
5700 

ttgggtgaga gtggagagat ggatgccctt cgaatacaga tggaagagcg attcatggca 
5760 

tcaaacccct ccaaagtctc ttatgagccc attacgacca cgttgaaacg caaacaagag 
5820 

gaggtgtctg ctattattat ccagagggct tacagacgct acctcttgaa gcaaaaagtt 
5880 

aaaaaggtat caagtatata caagaaagac aaaggcaaag aatgtgatgg aacacccatc 
5940 

aaagaagata ctctcattga taaactgaat gagaattcaa ctccagagaa aaccgatatg 
6000 

acgccttcca ccacgtctcc accctcgtat gatagtgtga ccaaaccaga aaaagaaaaa 
6060 

tttgaaaaag acaaatcaga aaaggaagac aaagggaaag atatcaggga aagtaaaaag 
6120 

taaaaagaaa ccaagaattt tccattttgt gatcaattgt ttacagcccg tgatggtgat 
6180 
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gtgtttgtgt caacaggact cccacaggag gtctatgcca aactgactgt ttttacaaat 
6240 

gtatacttaa ggtcagtgcc tataacaaga cagagacctc tggtcagcaa actggaactc 
6300 



agtaaactgg agaaatagta tcgatggg 
6328 



<210> 7 

<211> 6328 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ttcttggtgc cagcttatca atcccaaact ctgggtgtaa aagattctac agggcacttt 
60 



cttatgcaag gagctaaaca gtgattaaag gagcaggatg aaaagatggc acagtcagtg 
120 



ctggtaccgc caggacctga cagcttccgc ttctttacca gggaatccct tgctgctatt 
180 



gaacaacgca ttgcagaaga gaaagctaag agacccaaac aggaacgcaa ggatgaggat 
240 



gatgaaaatg gcccaaagcc aaacagtgac ttggaagcag gaaaatctct tccatttatt 
300 



tatggagaca ttcctccaga gatggtgtca gtgcccctgg aggatctgga cccctactat 
360 



atcaataaga aaacgtttat agtattgaat aaagggaaag caatctctcg attcagtgcc 
420 



acccctgccc tttacatttt aactcccttc aaccctatta gaaaattagc tattaagatt 
480 

ttggtacatt ctttattcaa tatgctcatt atgtgcacga ttcttaccaa ctgtgtattt 
540 

atgaccatga gtaaccctcc agactggaca aagaatgtgg agtatacctt tacaggaatt 
600 

tatacttftg aatcacttat taaaatactt gcaaggggct tttgtttaga agatttcaca 
660 



tttttacggg atccatggaa ttggttggat ttcacagtca ttacttttgc atatgtgaca 
720 



gagtttgtgg acctgggcaa tgtctcagcg ttgagaacat tcagagttct ccgagcattg 
780 
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aaaacaattt cagtcattcc aggcctgaag 
840 



aagaagcttt ctgatgtcat gatcttgact 
900 



ggattgcagt tgttcatggg caacctacga 
960 

tcttcctttg aaataaatat cacttccttc 
1020 



actttcaata ggacagtgag catatttaac 
1080 



ttttattttt tagaggggca aaatgatgct 
114 0 



cagtgtcctg aaggatacat ctgtgtgaag 
1200 

agctttgaca cctttagttg ggcctttttg 
1260 



tgggaaaacc tttatcaact gacactacgt 
1320 

gtgctggtca ttttcttggg ctcattctat 
1380 



atggcctatg aggaacagaa tcaggccaca 
1440 



tttcagcaga tgctcgaaca gttgaaaaag 
1500 



gccgcatctg ctgaatcaag agacttcagt 
1560 



agttcttcag tagcatctaa gttgagctcc 
1620 



aagaaaaaga aacagaaaga acagtctgga 
1680 



tcggaatctg aagacagcat aagaagaaaa 
1740 



ctgacatatg aaaagagatt ttcttctcca 
1800 



cttttctctc caagacgcaa cagtagggcg 
1860 



.le7August03 .ST25. txt 
accattgtgg gggccctgat ccagtcagtg 

gtgttctgtc taagcgtgtt tgcgctaata 

aataaatgtt tgcaatggcc tccagataat 

tttaacaatt cattggatgg gaatggtact 

tgggatgaat atattgagga taaaagtcac 

ctgctttgtg gcaacagctc agatgcaggc 

gctggtagaa accccaacta tggctacacg 

tccttatttc gtctcatgac tcaagacttc 

gctgctggga aaacgtacat gatatttttt 

ctaataaatt tgatcttggc tgtggtggcc 

ttggaagagg ctgaacagaa ggaagctgaa 

» 

caacaagaag aagctcaggc ggcagctgca 
ggtgctggtg ggataggagt tttttcagag 
aaaagtgaaa aagagctgaa aaacagaaga 
gaagaagaga aaaatgacag agtcctaaaa 
ggtttccgtt tttccttgga aggaagtagg 
caccagtcct tactgagcat ccgtggctcc 
agccttttca gcttcagagg tcgagcaaag 

re 33 
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gacattggct ctgagaatga ctttgctgac gatgagcaca gcacctttga ggacaatgac 
1920 

a ^ccgaagag actctctgtt cgtgccgcac agacatggag aacggcgcca cagcaatgtc 

agccaggcca gccgtgcctc cagggtgctc cccatcctgc ccatgaatgg gaagatgcat 
2040 

agcgctgtgg actgcaatgg tgtggtctcc ctggtcgggg gcccttctac cctcacatct 
2100 

gctgggcagc tcctaccaga gggcacaact actgaaacag aaataagaaa gagacggtcc 
2160 

agttcttatc atgtttccat ggatttattg gaagatccta catcaaggca aagagcaatg 
2220 

agtatagcca gtattttgac caacaccatg gaagaacttg aagaatccag acagaaatgc 
2280 

ccaccatgct ggtataaatt tgctaatatg tgtttgattt gggactgttg taaaccatgg 
2340 

ttaaaggtga aacaccttgt caacctggtt gtaatggacc catttgttga cctggccatc 
2400 

accatcfcgca ttgtcttaaa tacactcttc atggctatgg agcactatcc catgacggag 
2460 

cagttcagca gtgtactgtc tgttggaaac ctggtcttca cagggatctt cacagcagaa 
2520 

atgtttctca agataattgc catggatcca tattattact ttcaagaagg ctggaatatt 
2580 

tttgatggtt ttattgtgag ccttagttta atggaacttg gtttggcaaa tgtggaagga 
2640 

ttgtcagttc tccgatcatt ccggctgctc cgagttttca agttggcaaa atcttggcca 
2700 

actctaaata tgctaattaa gatcattggc aattctgtgg gggctctagg aaacctcacc 
2760 

ttggtattgg ccatcatcgt cttcattttt gctgtggtcg gcatgcagct ctttggtaag 
2820 

agctacaaag aatgtgtctg caagatttcc aatgattgtg aactcccacg ctggcacatg 
2880 

catgactttt tccactcctt cctgatcgtg ttccgcgtgc tgtgtggaga gtggatagag 
2940 

accatgtggg actgtatgga ggtcgctggc caaaccatgt gccttactgfc cttcatgatg 
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3000 
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i 


gtcatggtga 
3060 


ttggaaatct 


agtggttctg 


aacctcttct 


tggccttgct 


tttgagttcc 


■ 

3120 


acaatcttgc 


tgccactgat 


gatgataacg 


aaa tcraa taa 


tctccagatt 


gctgtgggaa 
3180 


ggatgcagaa 


aggaatcaat 


1 1 tcrttaaaa 




tgaatttatt 


cagaaagcct 
3240 


ttgttaggaa 


gcagaaagct 


ttagatgaaa 


ttaaaccgc t 


tgaagatcta 


aataataaaa 
3300 


aagacagctg 


tatttccaac 


cataccacca 


tagaaatagg 


caaagacctc 


aattatctca 
3360 


aagacggaaa 


tggaactact 


agtggcatag 


gcagcagtgt 


agaaaaatat 


gtcgtggatg 
3420 


aaagtgatta 


catgtcattt 


ataaacaacc 


ctagcctcac 


tgtgacagta 


ccaac tgc t.g 
3480 


ttggagaatc 


tgactttgaa 


aatttaaata 


ctgaagaatt 


cagcagcgag 


tcagatatgg 
3540 


aggaaagcaa 


agagaagcta 


aatgcaacta 


gttcatctga 


aggcagcacg 


gt tga tat tg 
3600 


gagctcccgc 


cgagggagaa 


cagcctgagg 


ttgaacctga 


ggaatccctt 


gaacc cgaag 
3660 


cctgttttac 


agaagactgt 


gtacggaagt 


tcaagtgttg 


tcagataagc 


atagaagaag 
3720 


gcaaagggaa 


actctggtgg 


aatttgagga 


aaacatgcta 


taagatagtg 


gagcacaatx 
3780 


ggttcgaaac 


cttcattgtc 


ttcatgattc 


tgctgagcag 


tggggctctg 


gcc ct cgaag 
3840 


atatatacat 


tgagcagcga 


aaaacca tta 


acracca tat t 


agaatatgct 


gacaaggt t t 
3900 


tcact tacat 


attcattctg 


gaaatgctgc 


taaagtgggt 


tgcatatggt 


t t tcaagtgt 
3960 


attt taccaa 


tgcctggtgc 


tggctagact 


tcctgattgt 


tgatgtctca 


ctggttagct' 
4020 


taactgcaaa 


tgccttgggt 


tactcagaac 


ttggtgccat 


caaatccctc 


agaacactaa 
4080 


gagctctgag 


gccactgaga 


gctttgtccc 


ggtttgaagg 


aatgagggtt 


* 
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gttgtaaatg ctcttttagg agccattcca tctatcatga atgtacttct ggtttgtc tg 
4140 



atcttttggc taatattcag tatcatggga gtgaatctct ttgctggcaa gttttaccat 
4200 



tgtattaatt acaccactgg agagatgttt gatgtaagcg tggtcaacaa ctacagtgag 
4260 



tgcaaagctc tcattgagag caatcaaact gccaggtgga aaaatgtgaa agtaaacttt 
4320 

gataacgtag gacttggata tctgtctcta cttcaagtag ccacgtttaa gggatggatg 
4380 

gatattatgt atgcagctgt tgattcacga aatgtagaat tacaacccaa gtatgaagac 
4440 

aacctgtaca tgtatcttta ttttgtcatc tttattattt ttggttcatt ctttaccttg 
4500 

aatcttttca ttggtgtcat catagataac ttcaaccaac agaaaaagaa gtttggaggt 
4560 

caagacattt ttatgacaga agaacagaag aaatactaca atgcaatgaa aaaactgggt 
4620 

tcaaagaaac cacaaaaacc catacctcga cctgctaaca aattccaagg aatggtcttt 
4680 

gattttgtaa ccaaacaagt ctttgatatc agcatcatga tcctcatctg ccttaacatg 
4740 

gtcaccatga tggtggaaac cgatgaccag agtcaagaaa tgacaaacat tctgtactgg 
4800 

attaatctgg tgtttattgt tctgttcact ggagaatgtg tgctgaaact gatctctctt 
4860 

cgttactact atttcactat tggatggaat atttttgatt ttgtggtggt cattctctcc 
4920 

attgtaggaa tgtttctggc tgaactgata gaaaagtatt ttgtgtcccc taccctgttc 
4980 

cgagtgatcc gtcttgccag gattggccga atcctacgtc tgatcaaagg agcaaagggg 
5040 

atccgcacgc tgctctttgc tttgatgatg tcccttcctg cgttgtttaa catcggcctc 
5100 

cttcttttcc tggtcatgtt catctacgcc atctttggga tgtccaattt tgcctatgtt 
5160 
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aagagggaag ttgggatcga tgacatgttc aactttgaga cctttggcaa cagcatgatc 
5220 

tgcctgttcc aaattacaac ctctgctggc tgggatggat tgctagcacc tattcttaat 
5280 

agtggacctc cagactgtga ccctgacaaa gatcaccctg gaagctcagt taaaggagac 
5340 

tgtgggaacc catctgttgg gattttcttt tttgtcagtt acatcatcat atccttcctg 
5400 

gttgtgctga acatgtacat cgcggtcatc ctggagaact tcagtgttgc tactgaagaa 
5460 

agtgcagagc ctctgagtga ggatgacttt gagatgttct atgaggtttg ggagaagttt 
5520 

gatcccgatg cgacccagtt tatagagttt gccaaacttt ctgattttgc agatgccctg 
5580 

gatcctcctc ttctcatagc aaaacccaac aaagtccagc tcattgccat ggatctgccc 
5640 

atggtgagtg gtgaccggat ccactgtctt gacatcttat ttgcttttac aaagcgtgtt 
5700 

ttgggtgaga gtggagagat ggatgccctc cgaatacaga tggaagagcg attcatggca 
5760 

tcaaacccct ccaaagtctc ttatgagccc attacgacca cgttgaaacg caaacaagag 
5820 

gaggtgtctg ctattattat ccagagggct tacagacgct acctcttgaa gcaaaaagtt 
5880 

aaaaaggtat caagtatata caagaaagac aaaggcaaag aatgtgatgg aacacccatc 
5940 

aaagaagata ctctcattga taaactgaat gagaattcaa ctccagagaa aaccgatatg 
6000 

acgccttcca ccacgtctcc accctcgtat gatagtgtga ccaaaccaga aaaagaaaaa 
6060 

tttgaaaaag acaaatcaga aaaggaagac aaagggaaag atatcaggga aagtaaaaag 
6120 

taaaaagaaa ccaagaattt tccattttgt gatcaattgt ttacagcccg tgatggtgat 
6180 

gtgtttgtgt caacaggact cccacaggag gtctatgcca aactgactgt ttttacaaat 
6240 

gtatacttaa ggtcagtgcc tataacaaga cagagacctc tggtcagcaa actggaactc 
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6300 



agtaaactgg agaaatagta tcgatggg 
6328 



<210> 8 

<211> 6328 

<212> DNA 

<213> Homo sapiens 

<400> 8 

ttcttggtgc cagcttatca atcccaaact ctgggtgtaa aagattctac agggcacttt 
60 



cttatgcaag gagctaaaca gtgattaaag gagcaggatg aaaagatggc acagtcagtg 
120 

ctggtaccgc caggacctga cagcttccgc ttctttacca gggaatccct tgctgctatt 
180 



gaacaacgca ttgcagaaga gaaagctaag agacccaaac aggaacgcaa ggatgaggat 
240 



gatgaaaatg gcccaaagcc aaacagtgac ttggaagcag gaaaatctct tccatttatt 
300 



tatggagaca ttcctccaga gatggtgtca gtgcccctgg aggatctgga cccctactat 
360 



atcaataaga aaacgtttat agtattgaat aaagggaaag caatctctcg attcagtgcc 
420 



acccctgccc tttacatttt aactcccttc aaccctatta gaaaattagc tattaagatt 
480 



ttggtacatt ctttattcaa tatgctcatt atgtgcacga ttcttaccaa ctgtgtattt 
540 



atgaccatga gtaaccctcc agactggaca aagaatgtgg agtatacctt tacaggaatt 
600 



tatacttttg aatcacttat taaaatactt gcaaggggct tttgtttaga agatttcaca 
660 



tttttacggg atccatggaa ttggttggat ttcacagtca ttacttttgc atatgtgaca 
720 



gagtttgtgg acctgggcaa tgtctcagcg ttgagaacat tcagagttct ccgagcattg 
780 



aaaacaattt cagtcattcc aggcctgaag accattgtgg gggccctgat ccagtcagtg 
840 



aagaagcttt ctgatgtcat gatcttgact gtgttctgtc taagcgtgtt tgcgctaata 
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900 

gg ^n g ° agt t 9 ttcat 99g caacctacga aataaatgtt tgcaatggcc tccagataat 
960 

tcttcctttg aaataaatat cacttccttc tttaacaatt cattggatgg gaatggtact 
1020 

actttcaata ggacagtgag catatttaac tgggatgaat atattgagga taaaaqtcac 
1080 a 

ttttattttt tagaggggca aaatgatgct ctgctttgtg gcaacagctc agatgcaggc 
114 0 

cagtgtcctg aaggatacat ctgtgtgaag gctggtagaa accccaacta tggctacacg 
1200 

agctttgaca cctttagttg ggcctttttg tccttatttc gtctcatgac tcaagacttc 
1260 

tgggaaaacc tttatcaact gacactacgt gctgctggga aaacgtacat gatatttttt 
1320 

gtgctggtca ttttcttggg ctcattctat ctaataaatt tgatcttggc tgtggtggcc 
13 80 

a £ggcctatg aggaacagaa tcaggccaca ttggaagagg ctgaacagaa ggaagctgaa 

tttcagcaga tgctcgaaca gttgaaaaag caacaagaag aagctcaggc ggcagctgca 

gccgcatctg ctgaatcaag agacttcagt ggtgctggtg ggataggagt tttttcagag 
1560 

agttcttcag tagcatctaa gttgagctcc aaaagtgaaa aagagctgaa aaacagaaga 

aagaaaaaga aacagaaaga acagtctgga gaagaagaga aaaatgacag agtcctaaaa 
1680 

tcggaatctg aagacagcat aagaagaaaa ggtttccgtt tttccttgga aggaagtagg 
1740 

ctgacatatg aaaagagatt ttcttctcca caccagtcct tactgagcat ccgtggctcc 
1800 

cttttctctc caagacgcaa cagtagggcg agccttttca gcttcagagg tcgagcaaag 
1860 

gacattggct ctgagaatga ctttgctgat gatgagcaca gcacctttga ggacaatgac 
1920 

agccgaagag actctctgtt cgtgccgcac agacatggag aacggcgcca cagcaatgtc 
1980 
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a 2040 ggCCa gccgtgcctc ca 999tgctc cccatcctgc ccatgaatgg gaagatgcat 

agcgctgtgg actgcaatgg tgtggtctcc ctggtcgggg gcccttctac cctcacatct 
2100 

gctgggcagc tcctaccaga gggcacaact actgaaacag aaataagaaa gagacggtcc 
2160 

agttcttatc atgtttccat ggatttattg gaagatccta catcaaggca aagagcaatg 
2220 

agtatagcca gtattttgac caacaccatg gaagaacttg aagaatccag acagaaatgc 
2280 

ccaccatgct ggtataaatt tgctaatatg tgtttgattt gggactgttg taaaccatgg 
2340 

ttaaaggtga aacaccttgt caacctggtt gtaatggacc catttgttga cctggccatc 
2400 

accatctgca ttgtcttaaa tacactcttc atggctatgg agcactafccc catgacggag 
2460 

cagttcagca gtgtactgtc tgttggaaac ctggtcttca cagggatctt cacagcagaa 
2520 

atgtttctca agataattgc catggatcca tattattact ttcaagaagg ctggaatatt 
2580 

tttgatggtt ttattgtgag ccttagttta atggaacttg gtttggcaaa tgtggaagga 
2640 

ttgtcagttc tccgatcatt ccggctgctc cgagttttca agttggcaaa atcttggcca 
2700 

actctaaata tgctaattaa gatcattggc aattctgtgg gggctctagg aaacctcacc 
2760 

ttggtattgg ccatcatcgt cttcattttt gctgtggtcg gcatgcagct ctttggtaag 
2820 

agctacaaag aatgtgtctg caagatttcc aatgattgtg aactcccacg ctggcacatg 
2880 

catgactttt tccactcctt cctgatcgtg ttccgcgtgc tgtgtggaga gtggatagag 
2940 

accatgtggg actgtatgga ggtcgctggc caaaccatgt gccttactgt cttcatgatg 
3000 

gtcatggtga ttggaaatct agtggttctg aacctcttct tggccttgct tttgagttcc 
3060 
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ttcagttctg acaatcttgc tgccactgat gatgataacg aaatgaataa tctccagatt 
3120 

gctgtgggaa ggatgcagaa aggaatcgat tttgttaaaa gaaaaatacg tgaatttatt 
3180 

cagaaagcct ttgttaggaa gcagaaagct ttagatgaaa ttaaaccgct tgaagatcta 
3240 

aataataaaa aagacagctg tatttccaac cataccacca tagaaatagg caaagacctc 
3300 

aattatctca aagacggaaa tggaactact agtggcatag gcagcagtgt agaaaaatat 
3360 

gtcgtggatg aaagtgatta catgtcattt ataaacaacc ctagcctcac tgtgacagta 
3420 

ccaattgctg ttggagaatc tgactttgaa aatttaaata ctgaagaatt cagcagcgag 
3480 

tcagatatgg aggaaagcaa agagaagcta aatgcaacta gttcatctga aggcagcacg 
3540 

gttgatattg gagctcccgc cgagggagaa cagcctgagg ttgaacctga ggaatccctt 
3600 

gaacctgaag cctgttttac agaagactgt gtacggaagt tcaagtgttg tcagataagc 
3660 

atagaagaag gcaaagggaa actctggtgg aatttgagga aaacatgcta taagatagtg 
3720 

gagcacaatt ggttcgaaac cttcattgtc ttcatgattc tgctgagcag tggggctctg 
3780 

gcctttgaag atatatacat tgagcagcga aaaaccatta agaccatgtt agaatatgct 
3840 

gacaaggttt tcacttacat attcahtctg gaaatgctgc taaagtgggt tgcatatggt 
3900 

tttcaagtgt attttaccaa tgcctggtgc tggctagact tcctgattgt tgatgtctca 
3960 

ctggttagct taactgcaaa tgccttgggt tactcagaac ttggtgccat caaatccctc 
4020 

agaacactaa gagctctgag gccactgaga gctttgtccc ggtttgaagg aatgagggtt 
4080 

gttgtaaatg ctcttttagg agccattcca tctatcatga atgtacttct ggtttgtctg 
4140 

atcttttggc taatattcag tatcatggga gtgaatctct ttgctggcaa gttttaccat 
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4200 - 



tgtattaatt acaccactgg agagatgttt 
4260 



tgcaaagctc tcattgagag caatcaaact 
4320 



gataacgtag gacttggata tctgtctcta 
4380 



gatattatgt atgcagctgt tgattcacga 
4440 



aacctgtaca tgtatcttta ttttgtcatc 
4500 



aatcttttca ttggtgtcat catagataac 
4560 



caagacattt ttatgacaga agaacagaag 
4620 



tcaaagaaac cacaaaaacc catacctcga 
4680 



gattttgtaa ccaaacaagt ctttgatatc 
4740 



gtcaccatga tggtggaaac cgatgaccag 
4800 



attaatctgg tgtttattgt tctgttcact 
4860 



cgttactact atttcactat tggatggaat 
4920 



attgtaggaa tgtttctggc tgaactgata 
4980 



cgagtgatcc gtcttgccag gattggccga 
5040 



atccgcacgc tgctctttgc tttgatgatg 
5100 



cttcttttcc tggtcatgtt catctacgcc 
- 5160 



aagagggaag ttgggatcga tgacatgttc 
5220 



tgcctgttcc aaattacaac ctctgctggc 
5280 



.le7August03 . ST25 . txt 
gatgtaagcg tggtcaacaa ctacagtgag 
gccaggtgga aaaatgtgaa agtaaacttt 

cttcaagtag ccacgtttaa gggatggatg 

■ 

aatgtagaat tacaacccaa gtatgaagac 

tttattatfct ttggttcatt ctttaccttg 

ttcaaccaac agaaaaagaa gtttggaggt 

aaatactaca atgcaatgaa aaaactgggt 

cctgctaaca aattccaagg aatggtcttt 

agcatcatga tcctcatctg ccttaacatg 

agtcaagaaa tgacaaacat tctgtactgg 

ggagaatgtg tgctgaaact gatctctctt 

atttttgatt ttgtggtggt cattctctcc 

gaaaagtatt ttgtgtcccc taccctgttc 

atcctacgtc tgaacaaagg agcaaagggg 

tcccttcctg cgttgtttaa catcggcctc 

atctttggga tgtccaattt tgcctatgtt 

aactttgaga cctttggcaa cagcatgatc 

tgggatggat tgctagcacc tattcttaat 
ye 42 



SequencesSSCPre-f ile7August03 . ST25 . txt 
agtggacctc cagactgtga ccctgacaaa gatcaccctg gaagctcagt taaaggagac 

tgtgggaacc catctgttgg gattttcttt tttgtcagtt acatcatcat atccttcctg 
5400 

gttgtgctga acatgtacat cgcggfccatc ctggagaact tcagtgttgc tactgaagaa 
5460 

agtgcagagc ctctgagtga ggatgacttt gagatgttct atgaggtttg ggagaagttt 
5520 

gatcccgatg cgacccagtt tatagagttt gccaaacttt ctgattttgc agatgccctg 
5580 

gatcctcctc ttctcatagc aaaacccaac aaagtccagc tcattgccat ggatctgccc 
5640 

atggtgagtg gtgaccggat ccactgtctt gacatcttat ttgcttttac aaagcgtgtt 
5700 

ttgggtgaga gtggagagat ggatgccctt cgaatacaga tggaagagcg attcatggca 
5760 

tcaaacccct ccaaagtctc ttatgagccc attacgacca cgttgaaacg caaacaagag 
5820 

gaggtgtctg ctattattat ccagagggct tacagacgct acctcttgaa gcaaaaagtt 
5880 

aaaaaggtat caagtatata caagaaagac aaaggcaaag aatgtgatgg aacacccatc 
5940 

aaagaagata ctctcattga taaactgaat gagaattcaa ctccagagaa aaccgatatg 
6000 

acgccttcca ccacgtctcc accctcgtat gatagtgtga ccaaaccaga aaaagaaaaa 
6060 

tttgaaaaag acaaatcaga aaaggaagac aaagggaaag atatcaggga aagtaaaaag 
6120 

taaaaagaaa ccaagaattt tccattttgt gatcaattgt ttacagcccg tgatggtgat 
6180 

gtgtttgtgt caacaggact cccacaggag gtctatgcca aactgactgt ttttacaaat 
6240 

gtatacttaa ggtcagtgcc tataacaaga cagagacctc tggtcagcaa actggaactc 
6300 

* 

agtaaactgg agaaatagta tcgatggg 
6328 
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<210> 9 

<211> 238 

<212> DNA 

<213> Homo sapiens 

<400> 9 ' 

gtggaactcc agcctaagta tgaagaaagt ctgtacatgt atctttactt tgttattttc 
60 

atcatctttg ggtccttctt caccttgaac ctgtttattg gtgtcatcat agataatttc 
120 

aaccagcaga aaaagaagat aagtatttct aatattttct ctcccactga aatagaaaat 
180 



238 




<210> 


10 


<211> 


373 


<212> 


DNA 


<213> 


Homo 


<400> 


10 


gcctgaagac 


60 





tcttgactgt gttctgtcta agcgtgtttg cgctaatagg attgcagttg ttcatgggca 
120 

acctacgaaa taaatgtttg caatggcctc cagataattc ttcctt tgaa ataaatatca 
180 

cttccttctt taacaattca ttggatggga atggtactac tttcaatagg acagtgagca 
240 

tatttaactg ggatgaatat attgaggata aaagtaagat atactctata aaccattaag 
300 

ttgtttagtt ctctaaatat taaatattat ataaaatgga aattatctca atttagatgt 
360 

gaatcaagtg act 
373 



<210> 11 

<211> 274 

<212> DNA 

<213> Homo sapiens 

<400> 11 

taggcacctg ataagagctt gcatcgtttc cttttttaag aaatcgtcaa ttagagactg 
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60 

tttctgatca taaaatttaa tagaattttt tgacttacag gcctttgaag atatatacat 
120 

tgagcagcga aaaaccatta agaccatgtt agaatatgct gacaaggttt tcacttacat 
180 

attcattctg gaaatgctgc taaagtgggt tgcatatggt tttcaagtgt attttaccaa 
240 " 

tgcctggtgc tggctagact tcctgattgt tgat 
274 



<210> 


12 


<211> 


154 


<212> 


DNA 


<213> 


Homo 


<400> 


12 


gtattgaata 


60 





aattttgggg aaaaagaaaa tgatatgact tttcttacag gccacgttta agggatggat 
12 0 

ggatattatg tatgcagctg ttgattcacg aaat 



154 




<210> 


13 


<211> 


219 


<212> 


DNA 


<213> 


Homo 


<400> 


13 


ttacagggca 


60 





attctaagtt ttattttatg tgttgtgttt tctttttcag acgtttatag tattgaataa 
120 

agggaaagca atctctcgat tcagtgccac ccctgccctt tacattttaa ctcccttcaa 
180 

ccctattaga aaattagcta ttaagatttt ggtacattc 
219 

<210> 14 

<211> 242 

<212> DNA 

<213> Homo sapiens 
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gtgcctgtat aaaacagaca ttggcatata ttaaaacagg aaaaccaatt agcagacttg 
60 

ccgttattga cttcctttct ttcctctaac ctaattacag ccagtgtcct gaaggataca 
120 

tctgtgtgaa ggctggtaga aaccccaact atggctacac gagctttgac acctttagtt 

gggccttttt gtccttattt cgtctcatga ctcaagactt ctgggaaaac ctttatcaac 
240 . ' 

tg 

242 



<210> 15 

<211> 388 

<212> DNA 

<213> Homo sapiens 

<400> 15 

gcggcagctg cagccgcatc tgctgaatca agagacttca gtggtgctgg tgggatagga 
60 

gttttttcag agagttcttc agtagcatct aagttgagct ccaaaagtga aaaagagctg 
120 

aaaaacagaa gaaagaaaaa gaaacagaaa gaacagtctg gagaagaaga gaaaaatgac 
180 

agagtcctaa aatcggaatc tgaagacagc ataagaagaa aaggtttccg tttttccttg 
240 

* 

gaaggaagta ggccgacata tgaaaagaga ttttcttctc cacaccaggt aaaaatatta 
300 

aattacatga attgtgttct cataaatttt ttaaaagaat atgccagaat ttaatggaga 
360 

gaaaaccgcc ttccacctgg atggcaca 
388 



<210> 16 

<211> 445 

<212> DNA 

<213> Homo sapiens 

<400> 16 

aagtcaatga ctatgacaca atgaatcaaa ttctgttttt cagaatgcca gctcttaact 
60 

ctcttcatct catttttgtt tcttttcttg ttattcatag tccttactga gcatccgtgg 
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120 

ctcccttttc tctccaagac gcaacagtag ggcgagcctt ttcagcttca gaggtcgagc 
180 

aaaggacatt ggctctgaga atgactttgc tgatgatgag cacagcacct ttgaggacaa 
240 

tgacagccga agagactctc tgttcgtgcc gcacagacat ggagaacggc gccacagcaa 

tgtcagccag gccagccgtg cctccagggt gctccccatc ctgcccatga atgggaagat 
360 

♦ 

gcatagcgct gtggactgca atggtgtggt ctccctggtc gggggccctt ctaccctcac 
• 420 

atctgctggg cagctcctac cagag 
445 

<210> 17 

<211> 221 

<212> DNA 

<213> Homo sapiens 

<400> 17 

aaatgcatac agaagatggg gggggggcat acctaattaa tttttatatt tagattaaag 
60 

aaaataatta aatgtgtttt tttgtgggat tgattttcag aagctaaatg caactagttc 
12 0 

atctgaaggc agcacggttg atattggagc tcccgccgag ggagaacagc ctgaggttga 
180 

acctgaggaa tcccttgaac ctgaagcctg ttttacagaa g 
221 

<210> 18 

<211> 221 

<212> DNA 

<213> Homo sapiens 

<400> 18 

aaaatgcata cagaagatgg gggggggcac acctaattaa tttttatatt tagattaaag 
60 

aaaataatta aatgtgtttt tttgtgggat tgattttcag aagctaaatg caactagttc 
120 

atctgaaggc agcacggttg atattggagc tcccgccgag ggagaacagc ctgaggttga 
180 
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acctgaggaa tcccttgaac ctgaagcctg ttttacagaa g 
221 



<210> 19 

<211> 221 

<212> DNA 

<213> Homo sapiens 

<400> 19 

aatgcataca gaagatgggg gggggggcac acctaattaa tttttatatt tagattaaag 
60 

aaaataatta aatgtgtttt tttgtgggat tgattttcag aagctaaatg caactagttc 
120 

atctgaaggc agcacggttg atattggagc tcccgccgag ggagaacagc ctgaggttga 
180 

acctgaggaa tcccttgaac ctgaagcctg ttttacagaa g 
221 



<210> 20 

<211> 1679 

<212> DNA 

<213> Homo sapiens 

<400> 20 

gggagctgtg gcgcggagcg gcccctctgc tgcgtctgcc ctcgttttgt ctcacgactc 
60 

acactcagtg ctccattccc caagagttcg cgttccccgc gcggcggtcg agaggcggct 
120 

gcccgcggtc ccgcgcgggc gcggggcgat ggcggcgcgg gggtcagggc cccgcgcgct 
180 

ccgcctgctg ctcttggtcc agctggtcgc gggggcgctg cggtctagcc gggcgcggcg 
240 

ggcggcgcgc agaggattat ctgaaccttc ttctattgca aaacatgaag atagtttgct 
300 

taaggattta tttcaagact acgaaagatg ggttcgtcct gtggaacacc tgaatgacaa 
360 

aataaaaata aaatttggac ttgcaatatc tcaattggtg gatgtggatg agaaaaatca 
420 

gttaatgaca acaaacgtct ggttgaaaca ggaatggata gatgtaaaat taagatg'gaa 
480 

ccctgatgac tatggtggaa taaaagttat acgtgttcct tcagactctt cgtggacacc 
540 , 

Page 48 



SequencesSSCPre-f ile7August03 . ST25 . txt 

agacatcatt ttgtttgata atgcagatgg acgttttgaa gggaccagta cgaaaacagt 
600 

catcaggtac aatggcactg tcacctggac tccaccggca aactacaaaa gttcctgtac 
660 

catagatgtc acgtttttcc catttgacct tcagaactgt tccatgaaat ttggttcttg 
720 

gacttatgat ggatcacagg ttgatataat tctagaggac caagatgtag acaagagaga 
780 

tttttttgat aatggagaat gggagattgt gagtgcaaca gggagcaaag gaaacagaac 
840 

cgacagctgt tgctggtatc cgtatgtcac ttactcattt gtaatcaagc gcctgcctct 
900 

cttttatacc ttgttcctta taataccctg tattgggctc tcatttttaa ctgtacttgt 
960 

cttctatctt ccttcaaatg aaggtgaaaa gatttgtctc tgcacttcag tacttgtgtc 
1020 

tttgactgtc ttccttctgg ttattgaaga gatcatacca tcatcttcaa aagtcatacc 
1080 

tctaattgga gagtatctgg tatttaccat gatttttgtg acactgtcaa ttatggtaac 
1140 

cgtcttcgct atcaacattc atcatcgttc ttcctcaaca cataatgcca tggcgccttt 
1200 

ggtccgcaag atatttcttc acacgcttcc caaactgctt tcgatgagaa gtcatgtaga 
1260 

caggtacttc actcagaaag aggaaactga gagtggtagt ggaccaaaat cttctagaaa 
1320 

cacattggaa gctgcgctcg attctattcg ctacattaca acacacatca tgaaggaaaa 
1380 

tgatgtccgt gaggttgttg aagattggaa attcatagcc caggttcttg atcggatgtt 
1440 

tctgtggact tttcttttcg tttcaattgt tggatctctt gggctttttg ttcctgttat 
1500 

ttataaatgg gcaaatatat taataccagt tcatattgga aatgcaaata agtgaagcct 
1560 

cccaagggac tgaagtatac atttagttaa cacacatata tctgatggca cctataaaat 
1620 
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t 1679 aaatg taagttatgt gttaaattta gtgcaagctt taacagacta agttgctaa 

<210> 21 

<211> 2664 

<212> DNA 

<213> Homo sapiens 

<400> 21 

gagagaacag cgtgagcctg tgtgcttgtg tgctgagccc tcatcccctc ctggggccag 

gcttgggttt cacctgcaga atcgcttgtg ctgggctgcc tgggctgtcc tcagtgqcac 
120 

ctgcatgaag ccgttctggc tgccagagct ggacagcccc aggaaaaccc acctctctgc 
180 

agagcttgcc cagctgtccc cgggaagcca aatgcctctc atgtaagtct tctgctcgac 
240 

ggggtgtctc ctaaaccctc actcttcagc ctctgtttga ccatgaaatg aagtgactga 

gctctattct gtacctgcca ctctatttct ggggtgactt ttgtcagctg cccagaatct 
360 

ccaagccagg ctggttctct gcatcctttc aatgacctgt tttcttctg t aaccacaggt 
42 0 

tcggtggbga gaggaagcct cgcagaatcc agcagaatcc tcacagaatc cagcagcagc 
480 

tctgctgggg acatggtcca tggtgcaacc cacagcaaag ccctgacctg acctcctgat 
540 

gctcaggaga agccatgggc ccctcctgtc ctgtgttcct gtccttcaca aagctcagcc 
600 

tgtggtggct ccttctgacc ccagcaggtg gagaggaagc taagcgccca cctcccaggg 
660 

ctcctggaga cccactctcc tctcccagtc ccacggcatt gccgcaggga ggctcgcata 
720 

ccgagactga ggaccggctc ttcaaacacc tcttccgggg ctacaaccgc tgggcgcgcc 
780 

cggtgcccaa cacttcagac gtggtgattg tgcgctttgg actgtccatc gctcagctca 
840 

tcgatgtgga tgagaagaac caaatgatga ccaccaacgt ctggctaaaa caggagtgga 
900 

Page 50 




SequencesSSCPre-f ile7August03 .ST25 . txt 
gcgactacaa actgcgctgg aaccccactg attttggcaa catcacatct ctcagggtcc 
960 

cttctgagat gatctggatc cccgacattg ttctctacaa caatgcagat ggggagtttg 
102 0 



cagtgaccca catgaccaag gcccacctct tctccacggg cactgtgcac tgggtgcccc 
1080 



cggccatcta caagagctcc tgcagcatcg acgtcacctt cttccccttc gaccagcaga 
1140 

actgcaagat gaagtttggc tcctggactt atgacaaggc caagatcgac ctggagcaga 
12 00 



tggagcagac tgtggacctg aaggactact gggagagcgg cgagtgggcc atcgtcaatg 
1260 



ccacgggcac ctacaacagc aagaagtacg actgctgcgc cgagatctac cccgacgtca 
1320 



cctacgcctt cgtcatccgg cggctgccgc tcttctacac catcaacctc atcatcccct 
1380 



gcctgctcat ctcctgcctc actgtgctgg tcttctacct gccctccgac tgcggcgaga 
1440 



agatcacgct gtgcatttcg gtgctgctgt cactcaccgt cttcctgctg ctcatcactg 
1500 



agatcatccc gtccacctcg ctggtcatcc cgctcatcgg cgagtacctg ctgttcacca 
1560 



tgatcttcgt caccctgtcc atcgtcatca ccgtcttcgt gctcaatgtg caccaccgct 
1620 

cccccagcac ccacaccatg ccccactggg tgcggggggc ccttctgggc tgtgtgcccc 
1680 



ggtggcttct gatgaaccgg cccccaccac ccgtggagct ctgccacccc ctacgcctga 
1740 



agctcagccc ctcttatcac tggctggaga gcaacgtgga tgccgaggag agggaggtgg 
1800 



tggtggagga ggaggacaga tgggcatgtg caggtcatgt ggccccctct gtgggcaccc 
1860 



tctgcagcca cggccacctg cactctgggg cctcaggtcc caaggctgag gctctgctgc 
1920 



aggagggtga gctgctgcta tcaccccaca tgcagaaggc actggaaggt gtgcactaca 
1980 



ttgccgacca cctgcggtct gaggatgctg actcttcggt gaaggaggac tggaagtatg 
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2040 

ttgccatggt catcgacagg atcttcctct ggctgtttat catcgtctgc ttcctgggga 
2100 

ccatcggcct ctttctgcct ccgttcctag ctggaatgat ctgactgcac ctccctcgag 
2160 

ctggctccca gggcaaaggg gagggttctt ggatgtggaa gggctttgaa caatgtttag 
2220 

atttggagat gagcccaaag tgccagggag aacagccagg tgaggtggga ggttggagag 
22 80 

ccaggtgagg tctctctaag tcaggctggg gttgaagttt ggagtctgtc cgagtttgca 
2340 

gggtgctgag ctgtatggtc cagcagggga gtaataaggg ctcttccgga aggggaggaa 
2400 

gcgggaggca ggcctgcacc tgatgtggag gtacaggcag atcttcccta ccggggaggg 
2460 

atggatggtt ggatacaggt ggctgggcta ttccatccat ctggaagcac atttgagcct 
2520 

ccaggcttct ccttgacgtc attcctctcc ttccttgctg caaaatggct ctgcaccagc 
2580 

cggcccccag gaggtctggc agagctgaga gccatggcct gcaggggctc catatgtccc 
2640 

tacgcgtgca gcaggcaaac aaga 
2664 

<210> 22 

<211> 3020 

<212> DNA 

<213> Homo sapiens 

<400> 22 

gtcctcccgc gggtccgagg gcgctggaaa cccagcggcg gcgaagcgga gaggagcccc 
60 

gcgcgtctcc gcccgcacgg ctccaggtct ggggtctgcg ctggagccgc gcggggagag 
120 

* 

gccgtctctg cgaccgccgc gcccgctccc gaccgtccgg gtccgcggcc agcccggcca 
180 

ccagccatgg gctctggccc gctctcgctg cccctggcgc tgtcgccgcc gcggctgctg 
240 

ctgctgctgc tgctgtctct gctgccagtg gccagggcct cagaggctga gcaccatcta 
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300 



tttgagcggc tgtttgaaga ttacaatgag 
3 60 



ccagtcatca tccatttcga ggtgtccatg 
420 



cagatcatgg agaccaacct gtggctcaag 
4 80 



aacccctctg actatggtgg ggcagagttc 
540 



ccagacattg tgctgtataa caatgctgtt 
600 



gccttactca agtacactgg ggaggtgact 
660 



tgtaaaatcg acgtgaccta cttcccgttt 
720 

tcctggtcct acgataaggc gaaaatcgat 
780 



aaggactatt gggagagcgg cgagtgggcc 
840 



atcaagtaca actgctgcga ggagatctac 
900 



cgcctgccct tgttctacac catcaacctc 
960 



actgtgctcg. tcttctacct gccctccgac 
1020 



gtcctcctct ccctgacggt gtttctcctg 
1080 



ctggtcatcc ccctgattgg agagtacctc 
1140 

atcgtcatca ccgtcttcgt gctcaacgtg 
1200 

ccctcatggg tgaagactgt attcttgaac 
1260 

ccaacaagca acgagggcaa cgctcagaag 
1320 

aatctgaatt gcttcagccg cgcagagtcc 
1380 



.le7August03 .ST25 . txt 

atcatccggc ctgtagccaa cgtgtctgac 

tctcagctgg tgaaggtgga tgaagtaaac 

caaatctgga atgacfcacaa gctgaagtgg 

atgcgtgtcc c.tgcacagaa gatctggaag 

ggggatttcc aggtggacga caagaccaaa 

tggatacctc cggccatctt taagagctcc 

gattaccaaa actgtaccat gaagttcggt 

ctggtcctga tcggctcttc catgaacctc 

atcatcaaag ccccaggcta caaacacgac 

cccgacatca catactcgct gtacatccgg 

atcatcccct gcctgctcat ctccttcctc 

tgcggtgaga aggtgaccct gtgcatttct 

gtgatcactg agaccatccc ttccacctcg 

ctgttcacca tgatttttgt aaccttgtcc 

cactacagaa ccccgacgac acacacaatg 

ctgctcccca gggtcatgtt catgaccagg 

ccgaggcccc tctacggtgc cgagctctca 

aaaggctgca aggagggcta cccctgccag 
ye 53 
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i 

gacgggatgt gtggttactg ccaccaccgc aggataaaaa tctccaattt 
1440 



ctcacgagaa gctctagttc tgaatctgtt gatgctgtgc tgtccctctc tgctttgtca 
1500 



ccagaaatca aagaagccat ccaaagtgtc aagtatattg ctgaaaatat gaaagcacaa 
1560 



aatgaagcca aagagattca agatgattgg aagtatgttg ccatggtgat tgatcgtatt 
1620 



tttctgtggg ttttcaccct ggtgtgcatt ctagggacag caggattgtt tctgcaaccc 
1680 



ctgatggcca gggaagatgc ataagcacta agctgtgtgc ctgcctggga gacttccttg 
1740 



tgtcagggca ggaggaggct gcttcctagt aagaacgtac tttctgttat caagctacca 
1800 



gctttgtttt tggcatttcg aggtttactt attttccact tatcttggaa tcatgcaaaa 
1860 



aaaaaaatgt caagagtatt tattaccgat aaatgaacat ttaactagcc tttttggtat 
1920 



ggtaaagaga tgtcaaaatg tgattctatg tgattagtat gctatgctat ggaatataca 
1980 



tgtaaaaatg tttcctttta gttgttgaaa caaaactgga tagaaaaatg ctgttcagaa 
2040 



atatgaaaag tcattcagtt atcactacag atctcccagt aatttttctt atttagccca 
2100 



taatctcttt gaaggtttat actaattcag caatccccca tcgttaccca tttcttacca 
2160 

tgcatttctc gttctttact gggtctaaag ggctatgcct ccatttcaga gagcttcaac 
2220 

tacttctctt gcatacttct aaattatact atgagaaatc atgcctagtt attcattgtt 
2280 

aatataactg tcttagtaca ccataaactg ggtggattat aaacaacaga aacttctcag 
2340 



ttttggaggt tgggaggtcc aaggtcaagg caccagcaaa tttggtgtct ggtgagggtc 
2400 

ctcttcctca aagggtgcct tctagctgtg tcctcacatg actgaaggga ctagctatct 
2460 
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ctgtggggtc tattttataa gggcactaac cccattcatg agagcagagc ccccatggcc 
2520 

taatcacctt tccaaggccc caccttctat ctaagacaat cacgctggga ataggtttca 
2580 



acatatgaat tgggggagga cacatttgga ccacagcatg aacctttaga acagggtttc 
2 640 



tcagccttag cactacggac attttgggct ggataaatat gtgttggtac agaatggggg 
27 00 



tatcctgtgc attgtaggat ctttagcagt accctagcct caactcacta gatgccaatg 
2760 



acataccttg cttcttcacc agttatgata accaagaatg tctccattgt taaatgtccc 
2820 

cttaggagca aaattgcccc tggttgagaa acattgcttt agacaaattg ttaagagtat 
2880 



catgtactac acttctgaaa cttaacgtga tcatcaccac tgacagatga ttcacagaga 
2940 

gagactgttt gaatcttgtc tcactagttt ttcctgtgca aaaataaaat ggacagaatt 
3000 



gcaaaaaaaa aaaaaaaaaa 
3020 



<210> 23 

<211> 2664 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gagagaacag cgtgagcctg tgtgcttgtg tgctgagccc tcatcccctc ctggggccag 
60 

gcttgggttt cacctgcaga atcgcttgtg ctgggctgcc tgggctgtcc tcagtggcac 
120 

ctgcatgaag ccgttctggc tgccagagct ggacagcccc aggaaaaccc acctctctgc 
180 

agagcttgcc cagctgtccc cgggaagcca aatgcctctc atgtaagtct tctgctcgac 
240 

ggggtgtctc ctaaaccctc actcttcagc ctctgtttga ccatgaaatg aagtgactga 
300 

gctctattct gtacctgcca ctctatttct ggggtgactt ttgtcagctg cccagaatct 
360 
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ccaagccagg ctggttctct gcatcctttc aatgacctgt tttcttctgt aaccacaggt 

tcggtggtga gaggaagcct cgcagaatcc agcagaatcc tcacagaatc cagcagcagc 
480 

tctgctgggg acatggtcca tggtgcaacc cacagcaaag ccctgacctg acctcctgat 
540 

gctcaggaga agccatgggc ccctcctgtc ctgtgttcct gtccttcaca aagctcagcc 
600 

tgtggtggct ccttctgacc ccagcaggtg gagaggaagc taagcgccca cctcccaggg 
660 

ctcctggaga cccactctcc tctcccagtc ccacggcatt gccgcaggga ggctcgcata 
720 

ccgagactga ggaccggctc ttcaaacacc tcttccgggg ctacaaccgc tgggcgcgcc 
780 

cggtgcccaa cacttcagac gtggtgattg tgcgctttgg actgtccatc gctcagctca 
840 

tcgatgtgga tgagaagaac caaatgatga ccaccaacgt ctggctaaaa caggagtgga 
900 

gcgattacaa actgcgctgg aaccccgctg attttggcaa catcacatct ctcagggtcc 
960 

cttctgagat gatctggatc cccgacattg ttctctacaa caatgcagat ggggagtttg 
1020 

cagtgaccca catgaccaag gcccacctct tctccacggg cactgtgcac tgggtgcccc 
1080 

cggccatcta caagagctcc tgcagcatcg acgtcacctt cttccccttc gaccagcaga 
1140 

actgcaagat gaagtttggc tcctggactt atgacaaggc caagatcgac ctggagcaga 
1200 

tggagcagac tgtggacctg aaggactact gggagagcgg cgagtgggcc atcgtcaatg 
1260 

4 

ccacgggcac ctacaacagc aagaagtacg actgctgcgc cgagatctac cccgacgtca 
1320 

cctacgcctt cgtcatccgg cggctgccgc tcttctacac catcaacctc atcatcccct 
1380 

gcctgctcat ctcctgcctc actgtgctgg tcttctacct gccctccgac tgcggcgaga 
1440 

agatcacgct gtgcatttcg gtgctgctgt cactcaccgt cttcctgctg ctcatcactg 
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1500 

agatcatccc gtccacctcg ctggtcatcc cgctcatcgg cgagtacctg ctgttcacca 
1560 i 

tgatcttcgt caccctgtcc atcgtcatca ccgtcttcgt gctcaatgtg caccaccgct 
162 0 

cccccagcac ccacaccatg ccccactggg tgcggggggc ccttctgggc tgtgtgcccc 
1680 

ggtggcttct gatgaaccgg cccccaccac ccgtggagct ctgccacccc ctacgcctga 
1740 

agctcagccc ctcttatcac tggctggaga gcaacgtgga tgccgaggag agggaggtgg 
1800 

tggtggagga ggaggacaga tgggcatgtg caggtcatgt ggccccctct gtgggcaccc 
1860 

tctgcagcca cggccacctg cactctgggg cctcaggtcc caaggctgag gctctgctgc 

aggagggtga gctgctgcta tcaccccaca tgcagaaggc actggaaggt gtgcactaca 
1980 

ttgccgacca cctgcggtct gaggatgctg actcttcggt gaaggaggac tggaagtatg 
2040 

ttgccatggt catcgacagg atcttcctct ggctgtttat catcgtctgc ttcctgggga 
2100 

ccatcggcct ctttctgcct ccgttcctag ctggaatgat ctgactgcac ctccctcgag 
2160 

ctggctccca gggcaaaggg gagggttctt ggatgtggaa gggctttgaa caatgtttag 
2220 

atttggagat gagcccaaag tgccagggag aacagccagg tgaggtggga ggttggagag 
2280 

ccaggtgagg tctctctaag tcaggctggg gttgaagttt' ggagtctgtc cgagtttgca 
2340 

gggtgctgag ctgtatggtc cagcagggga gtaataaggg ctcttccgga aggggaggaa 
2400 

gcgggaggca ggcctgcacc tgatgtggag gtacaggcag atcttcccta ccggggaggg 
2460 

atggatggtt ggatacaggt ggctgggcta ttccatccat ctggaagcac atttgagcct 
2520 

ccaggcttct ccttgacgtc attcctctcc ttccttgctg caaaatggct ctgcaccagc 
2580 
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cggcccccag gaggtctggc agagctgaga gccatggcct gcaggggctc catatgtccc 
2640 

tacgcgtgca gcaggcaaac aaga 
2664 



<210> 24 

<211> 2664 

<212> DNA 

<213> Homo sapiens 

<400> 24 

gagagaacag cgtgagcctg tgtgcttgtg tgctgagccc tcatcccctc ctggggccag 
6 0 

crcttgggttt cacctgcaga atcgcttgtg ctgggctgcc tgggctgtcc tcagtggcac 

■ 

ctgcatgaag ccgttctggc tgccagagct ggacagcccc aggaaaaccc acctctctgc 
180 

agagcttgcc cagctgtccc cgggaagcca aatgcctctc atgtaagtct tctgctcgac 

ggggtgtctc ctaaaccctc actcttcagc ctctgtttga ccatgaaatg aagtgactga 

gctctattct gtacctgcca ctctatttct ggggtgactt ttgtcagctg cccagaatct 
360 

ccaagccagg ctggttctct gcatcctttc aatgacctgt tttcttctgt aaccacaggt 
' 420 

tcggtggtga gaggaagcct cgcagaatcc agcagaatcc tcacagaatc cagcagcagc 
480 

tctgctgggg acatggtcca tggtgcaacc cacagcaaag ccctgacctg acctcctgat 
540 

gctcaggaga agccatgggc ccctcctgtc ctgtgttcct gtccttcaca aagctcagcc 
600 

tgtggtggct ccttctgacc ccagcaggtg gagaggaagc taagcgccca cctcccaggg 
660 

ctcctggaga cccactctcc tctcccagtc ccacggcatt gccgcaggga ggctcgcata 
720 

ccgagactga ggaccggctc ttcaaacacc tcttccgggg ctacaaccgc tgggcgcgcc 

cggtgcccaa cacttcagac gtggtgattg tgcgctttgg actgtccatc gctcagctca 
84 0 
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ccagccatgg gctctggccc gctctcgctg cccctggcgc tgtcgccgcc gcggctgctg 

ctgctgctgc tgctgtctct gctgccagtg gccagggcct cagaggctga gcaccgtcta 
3 00 

tttgagcggc tgtttgaaga ttacaatgag atcatccggc ctgtggccaa cgtgtctgac 
3 60 

ccagtcatca tccatttcga ggtgtccatg tctcagctgg tgaaggtgga tgaagtaaac 
420 

cagatcatgg agaccaacct gtggctcaag caaatctgga atgactacaa gctgaagtgg 

aacccctctg actatggtgg ggcagagttc atgcgtgtcc ctgcacagaa gatctggaag 
540 

ccagacattg tgctgtataa caatgctgtt ggggatttcc aggtggacga caagaccaaa 
600 

gccttactca agtacactgg ggaggtgact tggatacctc cggccatctt taagagctcc 
660 . 

tgtaaaatcg acgtgaccta cttcccgttt gattaccaaa actgtaccat gaagttcggt 
720 

tcctggtcct acgataaggc gaaaatcgat ctggtcctga tcggctcttc catgaacctc 
780 

aaggactatt gggagagcgg cgagtgggcc atcatcaaag ccccaggcta caaacacgac 
840 

atcaagtaca actgctgcga .ggagatctac cccgacatca catactcgct gtacatccgg 
900 

cgcctgccct tgttctacac catcaacctc atcatcccct gcctgctcat ctccttcctc 
960 

actgtgctcg tcttctacct gccctccgac tgcggtgaga aggtgaccct gtgcatttct 
1020 

gtcctcctct ccctgacggt gtttctcctg gtgatcactg agaccatccc ttccacctcg 
1080 

ctggtcatcc ccctgattgg agagtacctc ctgttcacca tgatttttgt aaccttgtcc 
1140 

atcgtcatca ccgtcttcgt gctcaacgtg cactacagaa ccccgacgac acacacaatg 
1200 

ccctcatggg tgaagactgt attcttgaac ctgctcccca gggtcatgtt catgaccagg 
1260 

ccaacaagca acgagggcaa cgctcagaag ccgaggcccc tctacggtgc cgagctctca 
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1320 



aatctgaatt gcttcagccg cgcagagtcc 
1380 



gacgggatgt gtggttactg ccaccaccgc 
1440 

ctcacgagaa gctctagttc tgaatctgtt 
1500 



ccagaaatca aagaagccat ccaaagtgtc 
1560 

aatgaagcca aagagattca agatgattgg 
1620 

tttctgtggg ttttcaccct ggtgtgcatt 
1680 



ctgatggcca gggaagatgc ataagcacta 
1740 



tgtcagggca ggaggaggct gcttcctagt 
1800 



gctttgtttt tggcatttcg aggtttactt 
1860 



aaaaaaatgt caagagtatt tattaccgat 
1920 



ggtaaagaga tgtcaaaatg tgattctatg 
1980 



tgtaaaaatg tttcctttta gttgttgaaa 
2040 



atatgaaaag teat tcagtt atcactacag 
2100 

taatctcttt gaaggtttat actaattcag 
2160 



tgcatttctc gttctttact- gggtctaaag 
2220 

tacttctctt gcatacttct aaattatact 
2280 

aatataactg tcttagtaca ccataaactg 
2340 

ttttggaggt tgggaggtcc aaggtcaagg 
2400 



ile7August03 . ST25 . txt 
aaaggctgca aggagggcta cccctgccag 

■ 

aggataaaaa tctccaattt cagtgctaac 

gatgctgtgc tgtccctctc tgctttgtca 

aagtatattg ctgaaaa'tat gaaagcacaa 

aagtatgttg ccatggtgat tgatcgtatt 

ctagggacag caggattgtt tctgcaaccc 

agctgtgtgc ctgcctggga gacttccttg 

aagaacgtac tttctgttat caagctacca 

attttccact tatcttggaa tcatgcaaaa 

aaatgaacat ttaactagcc tttfctggtat 

tgattagtat gctatgctat ggaatataca 

caaaactgga tagaaaaatg ctgttcagaa 

atctcccagt aatttttctt atttagccca 

caatccccca tcgttaccca tttcttacca 

ggctatgcct ccatttcaga gagcttcaac 

atgagaaatc atgcctagtt attcattgtt 

ggtggattat aaacaacaga aacttctcag 

caccagcaaa tttggtgtct ggtgagggtc 
jb 62 



SequencesSSCPre-f ile7August03 . ST25 . txt 

ctcttcctca aagggtgcct tctagctgtg tcctcacatg actgaaggga ctagctatct 
2460 



ctgtggggtc tattttataa gggcactaac cccattcatg agagcagagc ccccatggcc 
2520 



taatcacctt tccaaggccc caccttctat ctaagacaat cacgctggga ataggtttca 
2580 



acatatgaat tgggggagga cacatttgga ccacagcatg aacctttaga acagggtttc 
2640 



tcagccttag cactacggac attttgggct ggataaatat gtgttggtac agaatggggg 
2700 



tatcctgtgc attgtaggat ctttagcagt accctagcct caactcacta gatgccaatg 
2760 



acataccttg cttcttcacc agttatgata accaagaatg tctccattgt taaatgtccc 
2820 



cttaggagca aaattgcccc tggttgagaa acattgcttt agacaaattg ttaagagtat 
2880 

catgtactac acttctgaaa cttaacgtga tcatcaccac tgacagatga ttcacagaga 
2940 

gagactgttt gaatcttgtc tcactagttt ttcctgtgca aaaataaaat ggacagaatt 
3000 



gcaaaaaaaa aaaaaaaaaa 
3 020 



<210> 26 

<211> 3020 

<212> DNA 

<213> Homo sapiens 

<400> 26 

gtcctcccgc gggtccgagg gcgctggaaa cccagcggcg gcgaagcgga gaggagcccc 
60 

gcgcgtctcc gcccgcacgg ctccaggtct ggggtctgcg ctggagccgc gcggggagag 
120 

* 

gccgtctctg cgaccgccgc gcccgctccc gaccgtccgg gtccgcggcc agcccggcca 
180 

ccagccatgg gctctggccc gctctcgctg cccctggcgc tgtcgccgcc gcggctgctg 
240 

ctgctgctgc tgctgtctct gctgccagtg gccagggcct cagaggctga gcaccgtcta 
300 
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tttgagcggc tgtttgaaga ttacaatgag 
360 



ccagtcatca tccatttcga ggtgtccatg 
420 



cagatcatgg agaccaacct gtggctcaag 
480 



aacccctctg actatggtgg ggcagagttc 
540 



ccagacattg tgctgtataa caatgctgtt 
600 



gccttactca agtacactgg ggaggtgact 
660 



tgtaaaatcg acgtgaccta cttcccgttt 
720 



tcctggtcct acgataaggc gaaaatcgat 
780 

aaggactatt gggagagcgg cgagtgggcc 
840 



atcaagtaca actgctgcga ggagatctac 
900~ 



cgcctgccct tgttctacac catcaacctc 
960 



actgtgctcg tcttctacct gccctccgac 
1020 



gtcctcctct ccctgacggt gtttctcctg 
1080 



ctggtcatcc ccctgattgg agagtacctc 
1140 



atcgtcatca ccgtcttcgt gctcaacgtg 
1200 



ccctcatggg tgaagactgt attcttgaac 
1260 



ccaacaagca acgagggcaa cgctcagaag 
1320 

aatctgaatt gcttcagccg cgcagagtcc 
1380 



.le7August03 . ST25 . txt 
atcatccggc ctgtagccaa cgtgtctgac 

tctcagctgg tgaaggtgga tgaagtaaac 

caaatctgga atgactacaa gctgaaatgg 

atgcgtgtcc ctgcacagaa gatctggaag 

* 

ggggatttcc aggtggacga caagaccaaa 

tggatacctc cggccatctt taagagctcc 

gattaccaaa actgtaccat gaagttcggt 

ctggtcctga tcggctcttc catgaacctc 

atcatcaaag ccccaggcta caaacacgac 

cccgacatca catactcgct gtacatccgg 

atcatcccct gcctgctcat ctccttcctc 

tgcggtgaga aggtgaccct gtgcatttct 

gtgatcactg agaccatccc ttccacctcg 

ctgttcacca tgatttttgt aaccttgtcc 

cactacagaa ccccgacgac acacacaatg 

ctgctcccca gggtcatgtt catgaccagg 

ccgaggcccc tctacggtgc cgagctctca 

aaaggctgca aggagggcta cccctgccag 

re 64 




SequencesSSCPre-f ile7August03 . ST25 . txt 
g ?4?n gatgt gtggttactg ccaccaccgc aggataaaaa tctccaattt cagtgctaac 

J- *3 \J 

ctcacgagaa gctctagttc tgaatctgtt gatgctgtgc tgtccctctc tgctttgtca 
1500 

ccagaaatca aagaagccat ccaaagtgtc aagtatattg ctgaaaatat gaaagcacaa 
1560 

aatgaagcca aagagattca agatgattgg aagtatgttg ccatggtgat tgatcgtatt 
1620 

tttctgtggg ttttcaccct ggtgtgcatt ctagggacag caggattgtt tctgcaaccc 
1680 

c ^gatggcca gggaagatgc ataagcacta agctgtgtgc ctgcctggga gacttccttg 

tgtcagggca ggaggaggct gcttcctagt aagaacgtac tttctgt tat caagctacca 
1800 

gctttgtttt tggcatttcg aggtttactt attttccact tatcttggaa tcatgcaaaa 
I860 

aaaaaaatgt caagagtatt tattaccgat aaatgaacat ttaactagcc tttttggtat 
1920 

ggtaaagaga tgtcaaaatg tgattctatg tgattagtat gctatgctat ggaatataca 
1980 

tgtaaaaatg tttcctttta gttgttgaaa caaaactgga tagaaaaatg ctgttcagaa 
204 0 

atatgaaaag tcattcagtt atcactacag atctcccagt aatttttctt atttagccca 
2100 

taatctcttt gaaggtttat actaafctcag caatccccca tcgttaccca tttcttacca 
. 2160 

tgcatttctc gttctttact gggtctaaag ggctatgcct ccatttcaga gagcttcaac 
2220 

tacttctctt gcatacttct aaattatact atgagaaatc atgcctagtt attcattgtt 
2280 

aatataactg tcttagtaca ccataaactg ggtggattat aaacaacaga aacttctcag 
2340 

ttttggaggt tgggaggtcc aaggtcaagg caccagcaaa tttggtgtct ggtgagggtc 
2400 

ctcttcctca aagggtgcct tctagctgtg tcctcacatg actgaaggga ctagctatct 
2460 

ctgtggggtc tattttataa gggcactaac cccattcatg agagcagagc ccccatggcc 
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2520 

t ^co CaCCtt tccaa Sgccc caccttctat ctaagacaat cacgctggga ataggtttca 
2580 

acatatgaat tgggggagga cacatttgga ccacagcatg aacctttaga acagggtttc 
2640 ~ 

tcagccttag cactacggac attttgggct ggataaatat gtgttggtac agaatggggg 
27 00 

tatcctgtgc attgtaggat ctttagcagt accctagcct caactcacta gatgccaatg 
2760 

acataccttg cttcttcacc agttatgata accaagaatg tctccattgt taaatgtccc 

2 820 • 

cttaggagca aaattgcccc tggttgagaa acattgcttt agacaaattg ttaagagtat 
2880 • 

catgtactac acttctgaaa cttaacgtga tcatcaccac tgacagatga ttcacagaga 
2940 

gagactgttt gaatcttgtc tcactagttt ttcctgtgca aaaataaaat ggacagaatt 
3000 

gcaaaaaaaa aaaaaaaaaa 
3020 

<210> 27 

<211> 3020 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gtcctcccgc gggtccgagg gcgctggaaa cccagcggcg gcgaagcgga gaggagcccc 
.60 

gcgcgtctcc gcccgcacgg ctccaggtct ggggtctgcg ctggagccgc gcggggagag 
120 

gccgtctctg cgaccgccgc gcccgctccc gaccgtccgg gtccgcggcc agcccggcca 
180 

ccagccatgg gctctggccc gctctcgctg cccctggcgc tgtcgccgcc gcggctgctg 
240 

ctgctgctgc tgctgtctct gctgccagtg gccagggcct cagaggctga gcaccgtcta 

3 00 

tttgagcggc tgtttgaaga ttacaatgag atcatccggc ctgtagccaa cgtgtctgac 
3 60 

ccagtcatca tccatttcga ggtgtccatg tctcagctgg tgaaggtgga tgaagtaaac 
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420 

ca ^atcatgg agaccaacct gtggctcaag caaatctgga atgactacaa gctgaagtgg 

aacccctctg actatggtgg ggcagagttc atgcgtgtcc ctgcacagaa aatctggaag 
*3 4 0 

ccagacattg tgctgtataa caatgctgtt ggggatttcc aggtggacga caagaccaaa 
600 ~~ 

gccttactca agtacactgg ggaggtgact tggatacctc cggccatctt taagagctcc 
660 

tgtaaaatcg acgtgaccta cttcccgttt gattaccaaa actgtaccat gaagttcggt 
720 

tcctggtcct acgataaggc gaaaatcgat ctggtcctga tcggctcttc catgaacctc 
780 

aa |9^ c tatt gggagagcgg cgagtgggcc atcatcaaag ccccaggcta caaacacgac 

* 

atcaagtaca actgctgcga ggagatctac cccgacatca catactcgct gtacatccgg 
900 

cgcctgccct tgttctacac catcaacctc atcatcccct gcctgctcat ctccttcctc 
960 

actgtgctcg tcttctacct gccctccgac tgcggtgaga aggtgaccct gtgcatttct 
1020 

gtcctcctct ccctgacggt gtttctcctg gtgatcactg agaccatccc ttccacctcg 
1080 

ctggtcatcc ccctgattgg agagtacctc ctgttcacca tgatttttgt aaccttgtcc 
1140 

atcgtcatca ccgtcttcgt gctcaacgtg cactacagaa ccccgacgac acacacaatg 
1200 . 

ccctcatggg tgaagactgt attcttgaac ctgctcccca gggtcatgtt catgaccagg 
1260 

ccaacaagca acgagggcaa cgctcagaag ccgaggcccc tctacggtgc cgagctctca 
1320 

aatctgaatt gcttcagccg cgcagagtcc aaaggctgca aggagggcta cccctgccag 
1380 

gacgggatgt gtggttactg ccaccaccgc aggataaaaa tctccaattt cagtgctaac 
1440 

ctcacgagaa gctctagttc tgaatctgtt gatgctgtgc tgtccctctc tgctttgtca 
1500 
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ccagaaatca aagaagccat ccaaagtgtc aagtatattg ctgaaaatat gaaagcacaa 
1560 

aatgaagcca aagagattca agatgattgg aagtatgttg ccatggtgat tgatcgtatt 
162 0 

tttctgtggg ttttcaccct ggtgtgcatt ctagggacag caggattgtt tctgcaaccc 
1680 

-ctgatggcca gggaagatgc ataagcacta agctgtgtgc ctgcctggga gacttcctta 
1740 

tgtcagggca ggaggaggct gcttcctagt aagaacgtac tttctgttat caagctacca 
1800 " 

gctttgtttt tggcatttcg aggtttactt attttccact tatcttggaa tcatgcaaaa 
1860 

aaaaaaatgt caagagtatt tattaccgat aaatgaacat ttaactagcc tttttgatat 
1920 

ggtaaagaga tgtcaaaatg tgattctatg tgattagtat gctatgctat ggaatataca 
1980 

tgtaaaaatg tttcctttta gttgttgaaa caaaactgga tagaaaaatg ctgttcagaa 

atatgaaaag tcattcagtt atcactacag atctcccagt aatttttctt atttagccca 
2100 

taatctcttt gaaggtttat actaattcag caatccccca tcgttaccca tttcttacca 
2160 

tgcatttctc gttctttact gggtctaaag ggctatgcct ccatttcaga gagcttcaac 
2220 

tacttctctt gcatacttct aaattatact atgagaaatc atgcctagtt attcattgtt 
22 80 

aatataactg tcttagtaca ccataaactg ggtggattat aaacaacaga aacttctcag 
2340 

ttttggaggt tgggaggtcc aaggtcaagg caccagcaaa tttggtgtct ggtgagggtc 
2400 

ctcttcctca aagggtgcct tctagctgtg tcctcacatg actgaaggga ctagctatct 
2460 

ctgtggggtc tattttataa gggcactaac cccattcatg agagcagagc ccccatggcc 
2520 

taatcacctt tccaaggccc caccttctat ctaagacaat cacgctggga ataggtttca 
2580 
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a 2640 t9aat tggg9gagga ca catttgga ccacagcatg aacctttaga acagggtttc 

t 5 a nn Cttag cactac 99 ac attttgggct ggataaatat gtgttggtac agaatggggg 

tatcctgtgc attgtaggat ctttagcagt accctagcct caactcacta gatgccaatg 
2760 

acataccttg cttcttcacc agttatgata accaagaatg tctccattgt taaatgtccc 
2 820 

cttaggagca aaattgcccc tggttgagaa acattgcttt agacaaattg ttaagagtat 

catgtactac acttctgaaa cttaacgtga tcatcaccac tgacagatga ttcacagaga 
294 0 

9 3000 t9ttt gaatcttgtc tcactagttt ttcctgtgca aaaataaaat ggacagaatt 

gcaaaaaaaa aaaaaaaaaa 
.3020 

<210> 28 

<211> 210 

<212> DNA 

<213> Homo sapiens 

<400> 28 

gggctgcttg gcccaattct gggcatcccc ggggtgtgct agctttgccc taggctgctc 
60 

cctggaagcg aggttgacac aacttcttcc ccacacacag gagtggagcg actacaaact 
120 

gcgctggaac cccgctgatt ttggcaacat cacatctctc agggtccctt ctgagatgat 
180 

* 

ctggatcccc gacattgttc tctacaacaa 
210 

<210> 29 

<211> 210 

<212> DNA 

<213> Homo sapiens 

<400> 29 

agcaggggtg gggagtcacc aagatgggtg gtgccacggg aagtaaaacc aggctgattc 
60 

ttttaccgtc tccttctccc tccctgcttc cttccccgag atctggaatg actacaagct * 
120 
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gaagtggaac ccctctgact atggtggggc agagttcatg cgtgtccctg cacagaagat 
180 

ctggaagcca gacattgtgc tgtataacaa 
210 



<210> 30 

<211> 210 

<212> DNA 

<213> Homo sapiens 

<400> 30 

atctggaatg actacaagct gaagtggaac ccctctgact atggtggggc agagttcatg 
6 0 

cgtgtccctg cacagaagat ctggaagcca gacattgtgc tgtataacaa gtaaggtcct 
12 0 

ggggggccca cgccctctca gggctgtcag cctgggctct gggtttttgg cccactgtqc 
180 

ttaaaacctg gccttccttg gccttttcca 
210 

<210> 31 

<211> 7438 

<212> DNA 

<213> Homo sapiens 

<400> 31 

gctgagcctg agcccgaccc ggggcgcctc ccgccaggca ccatggtgca gaagtcgcgc 
60 

aacggcggcg tataccccgg cccgagcggg gagaagaagc tgaaggtggg cttcgtgggg 
120 

ctggaccccg gcgcgcccga ctccacccgg gacggggcgc tgctgatcgc cggctccgag 
180 

gcccccaagc gcggcagcat cctcagcaaa cctcgcgcgg gcggcgcggg cgccgggaag 
240 

cccccccaag cgcaacgcct tctaccgcaa gctgcagaat ttcctctaca acgtgctgga 
3 00 

gcggccgcgc ggctgggcgt tcatctacca cgcctacgtg ttcctcctgg ttttctcctg 
360 

cctcgtgctg tctgtgtttt ccaccatcaa ggagbatgag aagagctcgg agggggccct 
420 

ctacatcctg gaaatcgtga ctatcgtggt gtttggcgtg gagtacttcg tgcggatctg 
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480 

ggccgcaggc tgctgctgcc ggtaccgtgg ctggaggggg cggctcaagt ttgcccggaa 
540 

accgttctgt gtgattgaca tcatggtgct catcgcctcc attgcggtgc tggccgccgg 
600 

ctcccagggc aacgtctttg ccacatctgc gctccggagc ctgcgcttcc tgcagattct 
660 

gcggatgatc cgcatggacc ggcggggagg cacctggaag ctgctgggct ctgtggtcta 
720 

tgcccacagc aaggagctgg tcactgcctg gtacatcggc ttcctttgtc tcatcctggc 
780 

ctcgttcctg gtgtacttgg cagagaaggg ggagaacgac cactttgaca cctacgcgga 
840 

tgcactctgg tggggcctga tcacgctgac caccattggc tacggggaca agtaccccca 
900 

gacctggaac ggcaggctcc ttgcggcaac cttcaccctc atcggtgtct ccttcttcgc 
960 

gctgcctgca ggcatcttgg ggtctgggtt tgccctgaag gttcaggagc agcacaggca 
1020 

gaagcacttt gagaagaggc ggaacccggc agcaggcctg atccagtcgg cctggagatt 
1080 

ctacgccacc aacctctcgc gcacagacct gcactccacg tggcagtact acgagcgaac 
1140 

ggtcaccgtg cccatgtaca gttcgcaaac tcaaacctac ggggcctcca. gacttatccc 
1200 

cccgctgaac cagctggagc tgctgaggaa cctcaagagt aaatctggac tcgctttcag 
1260 

gaaggacccc ccgccggagc cgtctccaag ccagaaggtc agtttgaaag atcgtgtctt 
1320 

ctccagcccc cgaggcgtgg ctgccaaggg gaaggggtcc ccgcaggccc agactgtgag 
1380 

gcggtcaccc agcgccgacc agagcctcga ggacagcccc agcaaggtgc ccaagagctg 
1440 

gagcttcggg gaccgcagcc gggcacgcca ggctttccgc atcaagggtg ccgcgtcacg 
1500 

gcagaactca gaagaagcaa gcctccccgg agaggacatt gtggatgaca agagctgccc 
1560 
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ctgcgagttt gtgaccgagg acctgacccc gggcctcaaa gtcagcatca gagccgtgtg 
162 0 

tgtcatgcgg ttcctggtgt ccaagcggaa gttcaaggag agcctgcggc cctacgacgt 
1680 

gatggacgtc atcgagcagt actcagccgg ccacctggac atgctgtccc gaattaagag 
1740 

cctgcagtcc agagtggacc agatcgtggg gcggggccca gcgatcacgg acaaggaccg 
1800 

caccaagggc ccggccgagg cggagctgcc cgaggacccc agcatgatgg gacggctcgg 
I860 

gaaggtggag aagcaggtct tgtccatgga gaagaagctg gacttcctgg tgaatatcta 
192 0 

catgcagcgg atgggcatcc ccccgacaga gaccgaggcc tactttgggg ccaaagagcc 
1980 

ggagccggcg ccgccgtacc acagcccgga agacagccgg gagcatgtcg acaggcacgg 

c ^gcattgtc aagatcgtgc gctccagcag ctccacggge cagaagaact tctcggcgcc 

cccggccgcg ccccctgtcc agtgtccgcc ctccacctcc tggcagccac agagccaccc 
2160 ~ ~ ^ 

gcgccagggc cacggcacct cccccgtggg ggaccacggc tccctggtgc gcatcccgcc 
2220 

gccgcctgcc cacgagcggt cgctgtccgc ctacggcggg ggcaaccgcg ccagcatgga 
2280 

gttcctgcgg caggaggaca ccccgggctg caggcccccc gaggggaccc tgcgggacag 
2340 

cgacacgtcc atctccatcc cgtccgtgga ccacgaggag ctggagcgtt ccttcagcgg 
2400 

cttcagcatc tcccagtcca aggagaacct ggatgctctc aacagctgct acgcggccgt 
2460 

ggcgccttgt gccaaagtca ggccctacat tgcggaggga gagtcagaca ccgactccga 
2520 

cctctgtacc ccgtgcgggc ccccgccacg ctcggccacc ggcgagggtc cctttggtga 
2580 

cgtgggctgg gccgggccca ggaagtgagg cggcgctggg ccagtggacc cgcccgcggc 
2 64 0 
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cctcctcagc acggtgcctc cgaggttttg aggcgggaac cctctggggc ccttttctta 
2700 

cagtaactga gtgtggcggg aagggtgggc cctggagggg cccatgtggg ctgaaggatg 
2760 ~" ^ ~ 



ggggctcctg gcagtgacct tttacaaaag ttattttcca acagggcact cccaggccct 
2820 



gtcgccattg aggtgcctcc gctgggctgt ctcctcaccc ctccctgtgc tggagcctgt 
2880 



cccaaaaagg tgccaactgg gaggcctcgg aagccactgt ccaggctccc actgcctgtc 
2940 



tgctctgttc ccaaaggcag cgtgtgtggc ctcgggccct gcggtggcat gaagcatccc 
3000 



ttctggtgtg ggcatcgcta cgtgttttgg gggcagcgtt tcacggcggt gcccttgctg 
3060 



tctcccttgg gctggctcga gcctggggtc catgtccctt tgccgtcccg tcatggggca 
3120 



gggaatccat agcggggccc acaggcaggg gtatgagtgc gtcccaccca acgcagcacc 
3180 

agccccggcc accgctcccc gtgtccccag ttccgtctca gctacctgga ctccaggacc 
3240 

ctggagaagg gagacctggc agtggaggga ggctgtgctg tgtgtccccc tgcaggtgtg 
3300 

accccgcctg ctctttcctc ccccgccagg tgtggccccg cctgctcttt cctcccccac 
3360 

cagtatggcc ccacctgctc tttcctcccc ccccaaggtg tggccccacc tgttctttcc 
3420 

tcccctgccg aggtgtgacc ccacctgctc tttcctccct cccagtatgg ccccacctgc 
3480 

tctttcctcc cccgaggtga ggccccgcct gctctttcct cccatgggag ccgctgaggc 
3540 

* 

gtgcgcacct gggcacaggt tggggctctg caggatgagg aagacaggcc aatcccttcc 
3600 

ctcccagaag ctggccgccc agcaggaggg actgaggcca gactcatgtc cagcaaggaa 
3660 

cgtgtggtgt gtcccctggg aagtctctgg gccctgggaa gagggaaggt gcacgtcctg 
3720 

ggatggttgc ggggccctgt tttgggagac aaaggggtag agggtctgtc ttgggccccc 
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3780 



ccagactcta gcccgagcag tgcagccacc tactgcccca cctcagagaa gtgcagcggg 
3840 



aaggaggctg gaggtggtgc ggcgctgcct cgggtgtctg cgtgaatgag cgtggccaag 
39 00 



gaccagtgcc acctcatggc aaagagctcc cgcagtgttt gttagagtgc acatcctacg 
3960 



tgcccactgg cacacacacg tgctcacata catgtccgcg tacaggcgta cacatgcacg 
4020 



cttgcacaca tgcacacaga ccacatagca cacatgtgca ctgaccacac ctgtatagac 
4080 

catgcacagt acacatacgt gcatacacat gcctgcatac aggcatacac atgcacgctt 
4140 



acatgtacac gtgcacagat cacacacatg cacacacgtg tagctcacac acagtataca 
4200 

catacacaag tgcacagacc acacacagca ctaacacatg cacacacaaa gtgcataggc 
4260 

cacacagcac atgcacacag gtgcacagac cacacagcac acacaagtgc acagagcaca 
4320 

ctgcacacat gcacacacac acgcgtgcat gcacactcct cgcacttcca gccttggagc 
4380 

ccttctgtct ctggtctttc tctttgaccc tgctgagtgt aagctgcctg gggaggggct 
4440 

acaaggagta attgtggctt taggggtcgt ggtgatgctg gaatgtcaag cgccgtcgtg 
4500 

gggtatccga ctgtccgggc tcctggtccg cagtggcaga gcgccaggca gagccaatca 
4560 

gggtctcgtg ctgcccttcc cccccacagc ctggcagcca tccagaggag gggctctacc 
4620 

agatgccaag gtgccccggt gtctgtatgg gtgtccggtt gggtcctgtg tttggtctgc 
4680 

cctggaggtg gctgggccct cctgggatgg gtggctcagc ctcgaatccc aggccccagc 
4740 

ccaggcaggt gctgctgcct gttgtggttt cctggcccag cttctccttc tccctctgca 
4800 

taaaatcaca gtccgtgagt cttccagctg ccaccacggc tgggacacgc tgggggaggg 
4860 
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ctcctcccat gcctcctgca cacagccgtc tgagcagggc aggtgccaac accccccacc 
4920 

ggagacacgc tgcccctcag cgatgcccct accttttggg gggcctcgtc tcaagccccc 

ccttggaggc tgaaatcacc ccaggcactg tgagggcttc tccaggggga caccctttga 
5040 

gctgtgggtc tgatcacccc aagtcccgca cacggaggag aggcacagcc agggcgtgtg 
5100 "~ ~ 

gtttaatgtt tgccccttcg gggctggagg tctcagtgtt tctagattcc agaccctgct 
5160 

gccagagaga cctgctgccg gagagaaggg gaggaggact ccagctgggc tcggtccccc 
5220 

acagtcaggg acccccataa aggacacccc cttctctcta gaaagagctg ggctctcagc 
5280 

tatttctagt tgcttcccag aagccgagga gcagaaggag ctgtgagagc tttgcagaaa 
5340 

cgcccttgtc cccgccctcc tgagctatga atgccgtaca gagcagaggc tggggcattg 
. 5400 

gcaagatcac aggttgatgc tgcacagccc cattgaeaca aaccctcaaa gcagacgtga 
5460 

gagggacggt tcacaaagct tggacctgcc gtggagggtg cccggcagac gtggcgtgag 
5520 

agggacggct cacgaggctt ggacctgctg tggagggtgc ccagcagacg tggtgtgaga 
5580 

ggaacggctc acgagacttg gacctggtgg agggtgccca gcagacgtgg tgtgagaggg 
564 0 

acggctcaca gggcttggac cggagagaga tggctcatga gacttggacc tgccgtggag 
5700 

ggtgcccagc agacgtggta tgagagggat ggctcacgag gcttggacct ggfcggagggt 
5760 

gcccggcaga cgtgtgagag ggacggttca caaggcttgg acctgccatg gagggtgccc 
5820 

agcagacgtg gtgtgagagg gacagctcac gaggcttgga cctgccgtgg agggtgccca 
5880 

gcagggggct gagctctgag gggtgggtgc tcagtgcacg ggtgccccca gtgtcctctg 
594 0 
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atcctgtccg gtgcctcccc caacccccac acccatgcag aactcccagg tcacatgcac 
6000 

9 ^^ gtCCag ^catggggg tggcgtgaag aggcctggtc agggccttta ggggctgcag 

gacggaatgg ccacctgggg agcctgtgtg gctgtgccgg gcagccatcc tgcattccca 
612 0 

cccagcgcgc agtctccacc tcggccccag caaagcgcta agcagccgga gagacagcca 
6180 

gggcggcttc ctgaaggatg tgggatggtg gactccgggg tcgagggaat acgcaggttc 
624 0 

c ^gtcctccg ggagacctag agaagctgca cacccaggag ctttccatga cccgggagca 

tgagtgaatg gggggttcca gtttgctgaa ctttgctgtc ttgtaagggt gggggctgac 

63 60 

ggccgaccct gggaggaggt gacaccgcag ggggaggttg tgggcaacgg tggaggagga 

64 20 

gagacgggag gggaccattt gggatggagg ggcctcttca gagttttaaa aggcgtttgt 
6480 

ggggtggagt tgagtgtgct ctgggcttgg acacttgccg tggtgcccct ggctggccga 
6540 

ggagactggc tctggccagg gccccgtcct gagaggtcct cagcgtctga ctctcggcca 
6600 

ggcgccagca aggaggggcc ggtccccggg gctaccaggc aggcacgtgc acatcgccat 
6660 

C 6720° aC9C caactccgcc tgSTOttttac aaagtcgttg ccttaatgca tgtggacagg 

aactccctga ggtcgcccca tgccccctgg ctgtgccagg tacggacgcc ctggaccctg 
678 0 

C 6840 ag9tg gggcgggcga SST^gcccaag ggacgggctc cagagacacg cgcagggcag 

gaggggtctc acggaggggt ctcgcactga ggcgcccaga gctggtggtc ccgctggacg 
6900 

ccatccctct gcccgggatc cacacggccc acgtgtgccc gccatgcccg cgccccacgc 
6960 

cattgcagtc ttccatcctc tggccgtgac ggtggctgca gcttccccat ttgcgccgtt 
7020 

gcctctggct gtctgcactt ttgttcatgc tccaaagaac atttcataat gccttcagta 
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7080 

ccgacgtaca cttctgacca ttttgtatgt gtccttgtgc cgtagtgacc aggccttttt 
714 0 

» 

ttggtggatg tgttaccccg cacacttcaa tctcaacttt gtgcaccgtc cattttctaa 
7200 

ggatagacgc ccagggaatg aactctagtt ttctaacaga ttagctgaga tattaactta 
7260 

ctcacacgga caggttgatg ccagagccgt aagaatgcgc cagtgcgggt ttgcggggga 
732 0 

cttcgggtgt ggggtcctgc ggccgcgatg gccgtggaag gttctgggga tccctgctgc 
7380 

cacggggacg agttcggacg ccaggtggac ctgtgcactc agtaaaacgc agtgattc 
7438 



<210> 32 

<211> 7437 

<212> DNA 

<213> Homo sapiens 

<400> 32 

gcfcgagcctg agcccgaccc ggggcgcctc ccgccaggca ccgtggtgca gaagtcgcgc 
60 

« 

aacggcggcg tataccccgg cccgagcggg gagaagaagc tgaaggtggg cttcgtgggg 
120 

ctggaccccg gcgcgcccga ctccacccgg gacggggcgc tgctgatcgc cggctccgag 
180 

gcccccaagc gcggcagcat cctcagcaaa cctcgcgcgg gcggcgcggg cgccgggaag 
240 

• 

ccccccaagc gcaacgcctt ctaccgcaag ctgcagaatt tcctctacaa cgtgctggag 
300 

cggccgcgcg gctgggcgtt catctaccac gcctacgtgt tcctcctggt tttctcctgc 
360 

ctcgtgctgt ctgtgttttc caccatcaag gagtatgaga agagctcgga gggggccctc 
42 0 

tacatcctgg aaatcgtgac tatcgtggtg tttggcgtgg agtacttcgt gcggatctgg 
480 

gccgcaggct gctgctgccg gtaccgtggc tggagggggc ggctcaagtt tgcccggaaa 
540 

ccgttctgtg tgattgacat catggtgctc atcgcctcca ttgcggtgct ggccgccggc 
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600 

tcccagggca acgtctttgc cacatctgcg ctccggagcc tgcgcttcct gcagattctg 
660 

cggatgatcc gcatggaccg gcggggaggc acctggaagc tgctgggctc tgtggtctat 
720 

gcccacagca aggagctggt cactgcctgg tacatcggct tcctttgtct catcctggcc 
780 

tcgttcctgg tgtacttggc agagaagggg gagaacgacc actttgacac ctacgcggat 
840 

gcactctggt ggggcctgat cacgctgacc accattggct acggggacaa gtacccccag 
900 

acctggaacg gcaggctcct tgcggcaacc ttcaccctca tcggtgtctc cttcttcgcg 
9 60 

ctgcctgcag gcatcttggg gtctgggttt gccctgaagg ttcaggagca gcacaggcag 
1020 

aagcactttg agaagaggcg gaacccggca gcaggcctga tccagtcggc ctggagattc 
1080 

tacgccacca acctctcgcg cacagacctg cactccacgt ggcagtacta cgagcgaacg 
1140 

gtcaccgtgc ccatgtacag ttcgcaaact caaacctacg gggcctccag acttatcccc 
1200 

ccgctgaacc agctggagct gctgaggaac ctcaagagta aatctggact cgctttcagg 
1260 

aaggaccccc cgccggagcc gtctccaagc cagaaggtca gtttgaaaga tcgtgtcttc 
1320 

tccagccccc gaggcgtggc tgccaagggg aaggggtccc cgcaggccca gactgtgagg 
1380 

cggtcaccca gcgccgacca gagcctcgag gacagcccca gcaaggtgcc caagagctgg 
1440 

agcttcgggg accgcagccg ggcacgccag gctttccgca tcaagggtgc cgcgtcacgg 
1500 

cagaactcag aagaagcaag cctccccgga gaggacattg tggatgacaa gagctgcccc 
1560 

tgcgagtttg tgaccgagga cctgaccccg ggcctcaaag tcagcatcag agccgtgtgt 
1620 

gtcatgcggt tcctggtgtc caagcggaag ttcaaggaga gcctgcggcc ctacgacgtg 
1680 

Page 78 



SequencesSSCPre-f ile7August03 . ST25 . txt 

atggacgtca tcgagcagta ctcagccggc cacctggaca tgctgtcccg aattaagagc 
1740 " 

ctgcagtcca gagtggacca gatcgtgggg cggggcccag cgatcacgga caaggaccgc 
1800 

accaagggcc cggccgaggc ggagctgccc gaggacccca gcatgatggg acggctcggg 
1860 

aaggtggaga agcaggtctt gtccatggag aagaagctgg acttcctggt gaatatctac 
1920 

atgcagcgga tgggcatccc cccgacagag accgaggcct actttggggc caaagagccg 
1980 

gagccggcgc cgccgtacca cagcccggaa gacagccggg agcatgtcga caggcacggc 
2 0 4 0 

tgcattgtca agatcgtgcg ctccagcagc tccacgggcc agaagaactt ctcggcgccc 
2100 ^ 

ccggccgcgc cccctgtcca gtgtccgccc tccacctcct ggcagccaca gagccacccg 
2160 

cgccagggcc acggcacctc ccccgtgggg gaccacggct ccctggtgcg catcccgccg 
2220 

ccgcctgccc acgagcggtc gctgtccgcc tacggcgggg gcaaccgcgc cagcatggag 
22 80 

ttcctgcggc aggaggacac cccgggctgc aggccccccg aggggaccct gcgggacagc 
2340 

gacacgtcca tctccatccc gtccgtggac cacgaggagc tggagcgttc cttcagcggc 
2400 

ttcagcatct cccagtccaa ggagaacctg gatgctctca acagctgcta cgcggccgtg 
2460 

gcgccttgtg ccaaagtcag gccctacatt gcggagggag agtcagacac cgactccgac 
252 0 

ctctgtaccc cgtgcgggcc cccgccacgc tcggccaccg gcgagggtcc ctttggtgac 
2 580 ~ * ^ 

gtgggctggg ccgggcccag gaagtgaggc ggcgctgggc cagtggaccc gcccgcggcc 
2640 

ctcctcagca cggtgcctcc gaggttttga ggcgggaacc ctctggggcc cttttcttac 
2700 

agtaactgag tgtggcggga agggtgggcc ctggaggggc ccatgtgggc tgaaggatgg 
2760 
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gggctcctgg cagtgacctt ttacaaaagt tattttccaa cagggcactc ccaggccctg 
2820 

tcgccattga ggtgcctccg ctgggctgtc tcctcacccc tccctgtgct ggagcctgtc 
2880* 

ccaaaaaggt, gccaactggg aggcctcgga agccactgtc caggctccca ctgcctgtct 
2940 

S^ctgttcc caaaggcagc .gtgtgtggcc tcgggccctg cggtggcatg aagcatccct 

tctggtgtgg gcatcgctac gtgttttggg ggcagcgttt cacggcggtg cccttgctgt 
3 060 

ctcccttggg ctggctcgag cctggggtcc atgtcccttt gccgtcccgt catggggcag 
^ 3. 2 0 

ggaatccata gcggggccca caggcagggg tatgagtgcg tcccacccaa cgcagcacca 
3180 

gccccggcca ccgctccccg tgtccccagt tccgtctcag ctacctggac tccaggaccc 
3240 

tggagaaggg agacctggca gtggagggag gctgtgctgt gtgtccccct gcaggtgtga 
3300 

ccccgcctgc tctttcctcc cccgccaggt gtggccccgc ctgctctttc ctcccccacc 
3360 

agtatggccc cacctgctct ttcctccccc cccaaggtgt ggccccacct gttctttcct 
3420 

cccctgccga ggtgtgaccc cacctgctct ttcctccctc ccagtatggc cccacctgct 
3480 

ctttcctccc ccgaggtgag gccccgcctg ctctttcctc ccatgggagc cgctgaggcg 
3540 

tgcgcacctg ggcacaggtt ggggctctgc aggatgagga agacaggcca atcccttccc 
3600 

* 

tcccagaagc tggccgccca gcaggaggga ctgaggccag actcatgtcc agcaaggaac 
3660 

gtgtggtgtg tcccctggga agtctctggg ccctgggaag agggaaggtg cacgtcctgg 
3720 

* 

gatggttgcg gggccctgtt ttgggagaca aaggggtaga gggtctgtct tgggcccccc 
3780 

cagactctag, cccgagcagt gcagccacct actgccccac ctcagagaag tgcagcggga 
3840 

aggaggctgg aggtggtgcg gcgctgcctc gggtgtctgc gtgaatgagc gtggccaagg 
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3900 

accagtgcca cctcatggca aagagctccc gcagtgtttg ttagagtgca catcctacgt 
3960 

g 4020 CtggC acacacacgt ^ctcacatac atgtccgcgt acaggcgtac acatgcacgc 

ttgcacacat gcacacagac cacatagcac acatgtgcac tgaccacacc tgtatagacc 
4080 

atgcacagta cacatacgtg catacacatg cctgcataca ggcatacaca tgcacgctta 
4140 

catgtacacg tgcacagatc acacacatgc acacacgtgt agctcacaca cagtatacac 
4200 ~ » 

a tacacaagt gcacagacca cacacagcac taacacatgc acaGacaaag tgcataggcc 

acacagcaca tgcacacagg tgcacagacc acacagcaca cacaagtgca cagagcacac 
4320 

t^cacacatg cacacacaca cgcgtgcatg cacactcctc gcacttccag ccttggagcc 

i 

cttctgtctc tggtctttct ctttgaccct gctgagtgta agctgcctgg ggaggggcta 
4440 

C 450Q agtaa ttgtggctt:t a ^ggtcgtg gtgatgctgg aatgtcaagc gccgtcgtgg 

ggtatccgac tgtccgggct cctggtccgc agtggcagag cgccaggcag agccaatcag 
4560 

ggtctcgtgc tgcccttccc ccccacagcc tggcagccat ccagaggagg ggctctacca 
4620 

gatgccaagg tgccccggtg tctgtatggg tgtccggttg ggtcctgtgt ttggtctgcc 
4680 

ctggaggtgg ctgggccctc ctgggatggg tggctcagcc tcgaatccca ggccccagcc 
4740 

caggcaggtg ctgctgcctg ttgtggtttc ctggcccagc ttctccttct ccctctgcat 
4800 

aaaatcacag tccgtgagtc ttccagctgc caccacggct gggacacgct gggggagggc 
4860 

tcctcccatg cc tec tgcac acagccgtct gagcagggca ggtgccaaca ccccccaccg 
4920 

gagacacget gcccctcagc gatgccccta ccttttgggg ggcctcgtct caagcccccc 
4980 
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C 504Q ag9Ct gaaatcaccc caggcactgt gagggcttct ccagggggac accctttgag 

C Mnn g9tCt gatcacccca agtcccgcac acggaggaga ggcacagcca gggcgtgtgg 

t ^?? t9ttt 9 cccctt cgg ggctggaggt ctcagtgttt ctagattcca gaccctgctg 

5160 

C c a ?^ gaga ° ctgct 9 cc 9g agagaagggg aggaggactc cagctgggct cggtccccca 
52 2 0 

cagtcaggga cccccataaa ggacaccccc ttctctctag aaagagctgg gctctcagct 
52 80 

a c^S tagtt gcttcccaga agccgaggag cagaaggagc tgtgagagct ttgcagaaac 
g q4nn tgtCC ccgccctcct gagctatgaa tgccgtacag agcagaggct ggggcattgg 
C 5460 tCaCa 9gttgatgct ^cacagcccc attgacacaa accctcaaag cagacgtgag 

agggacggtt cacaaagctt ggacctgccg tggagggtgc ccggcagacg tggcgtgaga 
552 0 

gggacggctc acgaggcttg gacctgctgt ggagggtgcc cagcagacgt ggtgtgagag 
558 0 

gaacggctca cgagacttgg acctggtgga gggtgcccag cagacgtggt gtgagaggga 
5 640 ~" ' 

cggctcacag ggcttggacc ggagagagat ggctcatgag acttggacct gccgtggagg 
57 00 ■ -< ~- - ... ^ 

gtgcccagca gacgtggtat gagagggatg gctcacgagg cttggacctg gtggagggtg 
5760 

cccggcagac gtgtgagagg gacggttcac aaggcttgga cctgccatgg agggtgccca 
5820 

gcagacgtgg tgtgagaggg acagctcacg aggcttggac ctgccgtgga gggtgcccag 
5880 . " ~ ' ^ 

cagggggctg agctctgagg ggtgggtgct cagtgcacgg gtgcccccag tgtcctctga 
5940 * 

tcctgtccgg tgcctccccc aacccccaca cccatgcaga actcccaggt cacatgcacg 

tatgtccagg gcatgggggt ggcgtgaaga ggcctggtca gggcctttag gggctgcagg 
6060 " 
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a ???n atggc ca cctgggga gcctgtgtgg ctgtgccggg cagccatcct gcattcccac 

°6180 gC9Ca 9tctccacct cggccccagc aaagcgctaa gcagccggag agacagccag 
g ?o?n CttCC ^aggatgt gggatggtgg actccggggt cgagggaata cgcaggttcc 

6240 

t ^^ tCCgg gagacctaga gaagctgcac acccaggagc tttccatgac ccgggagcat 
6 3 00 

gagtgaatgg ggggttccag tttgctgaac tttgctgtct tgtaagggtg ggggctgacg 
63 60 - 

S^gaccctg ggaggaggtg acaccgcagg gggaggttgt gggcaacggt ggaggaggag 
642 0 

agacgggagg ,ggaccatttg ggatggaggg gcctcttcag agttttaaaa ggcgtttgtg 
6480-" 

gggtggagtt gagtgtgctc tgggcttgga cacttgccgt ggtgcccctg gctggccgag 
6540 - ~> ^ 

gagactggct ctggccaggg ccccgtcctg agaggtcctc agcgtctgac tctcggccag 
6600 

gcgccagcaa ggaggggccg gtccccgggg ctaccaggca ggcacgtgca catcgccatc 
6660 

gccacacgcc aactccgcct gggttttaca aagtcgttgc cttaatgcat gtggacagga 
672 0 

actccctgag gtcgccccat gccccctggc tgtgccaggt acggacgccc tggaccctgc 
6780 

gaacaggtgg ggcgggcgag gggcccaagg gacgggctcc agagacacgc gcagggcagg 
6840 

aggggtctca cggaggggtc tcgcactgag gcgcccagag ctggtggtcc cgctggacqc 
6900 " 

catccctctg cccgggatcc acacggccca cgtgtgcccg ccatgcccgc gccccacgcc 
6960 

a ^tgcagtct tccatcctct ggccgtgacg gtggctgcag cttccecatt tgcgccgttg 

cctctggctg tctgcacttt tgttcatgct ccaaagaaca tttcataatg ccttcagtac 
7080 

cgacgtacac ttctgaccat tttgtatgtg tccttgtgcc gtagtgacca ggcctttttt 
7140 

tggtggatgt gttaccccgc acacttcaat ctcaactttg tgcaccgtcc attttctagg 
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7200 

gatagacgcc cagggaatga actctagttt tctaacagat tagctgagat attaacttac 
7260 ~ 

tcacacggac aggttgatgc cagagccgta agaatgcgcc agtgcgggtt tgcgggggac 
7320 

ttcgggtgtg gggtccbgcg gccgcgatgg ccgtggaagg ttctggggat ccctgctgcc 
7380 ' 

acggggacga gttcggacgc caggtggacc tgtgcactca gtaaaacgca gtgattc 
7437 



<210> 33 

<211> 7437 

<212> DNA 

<213> Homo sapiens 

<400> 33 

gCt 60 9CCtg agcccgaccc g^ggcgcctc ccgccaggca ccacggtgca gaagtcgcgc 

aacggcggcg tataccccgg cccgagcggg gagaagaagc tgaaggtggg cttcgtgggg 
120 

ctggaccccg gcgcgcccga ctccacccgg gacggggcgc tgctgatcgc cggctccgag 
180 

gcccccaagc gcggcagcat cctcagcaaa cctcgcgcgg gcggcgcggg cgccgggaag 
240 

ccccccaagc gcaacgcctt ctaccgcaag ctgcagaatt tcctctacaa cgtgctggag 
300 

cggccgcgcg gctgggcgtt catctaccac gcctacgtgt tcctcctggt tttctcctgc 
360 

ctcgtgctgt ctgtgttttc caccatcaag gagtatgaga agagctcgga gggggccctc 
420 

tacatcctgg aaatcgtgac tatcgtggtg tttggcgtgg agtacttcgt gcggatctgg 
480 

gccgcaggct gctgctgccg gtaccgtggc tggagggggc ggctcaagtt tgcccggaaa 
540 

ccgttctgtg tgattgacat catggtgctc atcgcctcca ttgcggtgct ggccgccggc 
600 

tcccagggca acgtctttgc cacatctgcg ctccggagcc tgcgcttcct gcagattctg 
660 

cggatgatcc gcatggaccg gcggggaggc acctggaagc tgctgggctc tgtggtctat 
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720 

gcccacagca aggagctggt cactgcctgg tacatcggct tcctttgtct catcctggcc 
780 

tcgttcctgg tgtacttggc agagaagggg gagaacgacc actttgacac ctacgcggat 
840 

gcactctggt ggggcctgat cacgctgacc accattggct acggggacaa gtacccccag 
900 

acctggaacg gcaggctcct tgcggcaacc ttcaccctca tcggtgtctc cttcttcgcg 
960 

ctgcctgcag gcatcttggg gtctgggttt gccctgaagg ttcaggagca gcacaggcag 
1020 

aagcactttg agaagaggcg gaacccggca gcaggcctga tccagtcggc ctggagattc 
1080 

tacgccacca acctctcgcg cacagacctg cactccacgt ggcagtacta cgagcgaacg 
1140 

gtcaccgtgc ccatgtacag ttcgcaaact caaacctacg gggcctccag acttatcccc 
1200 

ccgctgaacc agctggagct gctgaggaac ctcaagagta aatctggact cgctttcagg 
1260 

aaggaccccc cgccggagcc gtctccaagc cagaaggtca gtttgaaaga tcgtgtcttc 
1320 

tccagccccc gaggcgtggc tgccaagggg aaggggtccc cgcaggccca gactgtgagg 
1380 

cggtcaccca gcgccgacca gagcctcgag gacagcccca gcaaggtgcc caagagctgg 
1440 

agcttcgggg accgcagccg ggcacgccag gctttccgca tcaagggtgc cgcgtcacgg 
1500 

cagaactcag aagaagcaag cctccccgga gaggacattg tggatgacaa gagctgcccc 
1560 

tgcgagtttg tgaccgagga cctgaccccg ggcctcaaag tcagcatcag agccgtgtgt 
1620 

gtcatgcggt tcctggtgtc caagcggaag ttcaaggaga gcctgcggcc ctacgacgtg 
1680 

atggacgtca tcgagcagta ctcagccggc cacctggaca tgctgtcccg aattaagagc 
1740 

ctgcagtcca gagtggacca gatcgtgggg cggggcccag cgatcacgga caaggaccgc 
1800 
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accaagggcc cggccgaggc ggagctgccc gaggacccca gcatgatggg acggctcggg 
looO 

a foon" aga a ^ ca ^ tctt gtccatggag aagaagctgg acttcctggt gaatatctac 

atgcagcgga tgggcatccc cccgacagag accgaggcct actttggggc caaagagccg 
1980 

gagccggcgc cgccgtacca cagcccggaa gacagccggg agcatgtcga caggcacggc 
204 0 

tgcattgtca agatcgtgcg ctccagcagc tccacgggcc agaagaactt ctcggcgccc 

2100 3=7 53 

ccggccgcgc cccctgtcca gtgtccgccc tccacctcct ggcagccaca gagccacccg 
2160 

cgccagggcc acggcacctc ccccgtgggg gaccacggct ccctggtgcg catcccgccg 
2 2 2 0 

ccgcctgccc acgagcggtc gctgtccgcc tacggcgggg gcaaccgcgc cagcatggag 
2280 

ttcctgcggc aggaggacac cccgggctgc aggccccccg aggggaccct gcgggacagc 
2 3 4 0 

* 

gacacgtcca tctccatccc gtccgtggac cacg.aggagc tggagcgttc cttcagcggc 
2400 

ttcagcatct cccagtccaa ggagaacctg gatgctctca acagctgcta cgcggccgtg 

gcgccttgtg ccaaagtcag gccctacatt gcggagggag agtcagacac cgactccgac 
2520 

ctctgtaccc cgtgcgggcc cccgccacgc tcggccaccg gcgagggtcc ctttggtgac 
2 580 

gtgggctggg ccgggcccag gaagtgaggc ggcgctgggc cagtggaccc gcccgcggcc 
2 640 

ctcctcagca cggtgcctcc gaggttttga ggcgggaacc ctctggggcc cttttcttac 
2700 

agtaactgag tgtggcggga agggtgggcc ctggaggggc ccatgtgggc tgaaggatgg 

27 60 "~ 

gggctcctgg cagtgacctt ttacaaaagt tattttccaa cagggcactc ccaggccctg 
2820 

tcgccattga ggtgcctccg ctgggctgtc tcctcacccc tccctgtgct ggagcctgtc 
2 880 
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C 2940 aa99t gccaacCggg aggcctcgga agccactgtc caggctccca ctgcctgtct 

9 3000 gttCC caaag9cagc gt9tgtggcc tcgggccctg cggtggcatg aagcatccct 

t 3060 tgt9g gcatcgctac fftSTttttggg ggcagcgttt cacggcggtg cccttgctgt 

C 3120 tt9 " ctggctcgag cctggggtcc atgtcccttt gccgtcccgt catggggcag 

9 ?fon CCata 9 c g999 ccca caggcagggg tatgagtgcg tcccacccaa cgcagcacca 

3180 

9 324o" CCa ccgctccccg tgtccccagt bccgtctcag ctacctggac tccaggaccc 
t ??nn aa " 9 asacctggca gtggagggag gctgtgctgt gtgtccccct gcaggtgtga 

J V v v 

ccccgcctgc tctttcctcc cccgccaggt gtggccccgc ctgctctttc ctcccccacc 
3 360 

agtatggccc cacctgctct ttcctccccc cccaaggtgt ggccccacct gttctttcct 
3420 

cccctgccga ggtgtgaccc cacctgctct ttcctccctc ccagtatggc cccacctgct * 
3480 

ctttcctccc ccgaggtgag gccccgcctg ctctttcctc ccatgggagc cgctgaggcg 
3540 

tgcgcacctg ggcacaggtt ggggctctgc aggatgagga agacaggcca atcccttccc 
3600 

tcccagaagc tggccgccca gcaggaggga ctgaggccag actcatgtcc agcaaggaac 
3 660 

g wn gt9tg tcccctggga agtctctggg ccctgggaag agggaaggtg cacgtcctgg 

gatggttgcg gggccctgtt ttgggagaca aaggggtaga gggtctgtct tgggcccccc 
3780 

cagactctag cccgagcagt gcagccacct actgccccac ctcagagaag tgcagcggga 
3840 

aggaggctgg aggtggtgcg gcgctgcctc gggtgtctgc gtgaatgagc gtggccaagg 
3900 

accagtgcca cctcatggca aagagctccc gcagtgtttg ttagagtgca catcctacgt 
3960 

gcccactggc acacacacgt gctcacatac atgtccgcgt acaggcgtac acatgcacgc 
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402 0 

t 4080 CaCat gcacacagac ca ^atagcac acatgtgcac tgaccacacc tgtatagacc 

a ^n Cagta cacata cgtg catacacatg cctgcataca ggcatacaca tgcacgctta 
4140 

C ^^H acacg t ST ca cagatc acacacatgc acacacgtgt agctcacaca cagtatacac 
4200 

atacacaagt gcacagacca cacacagcac taacacatgc acacacaaag tgcataggcc 
4260 

acacagcaca tgcacacagg tgcacagacc acacagcaca cacaagtgca cagagcacac 
4320 

tgcacacatg cacacacaca cgcgtgcatg cacactcctc gcacttccag ccttggaacc 
43 80 

cttctgtctc tggtctttct ctttgaccct gctgagtgta agctgcctgg ggaggggcta 
4 440 

C 4500 a9taa ttgtggcttt a 99ggtcgtg gtgatgctgg aatgtcaagc gccgtcgtgg 

ggtatccgac tgtccgggct cctggtccgc agtggcagag cgccaggcag agccaatcag 
4 560 

ggtctcgtgc tgcccttccc ccccacagcc tggcagccat ccagaggagg ggctctacca 
4620 

gatgccaagg tgccccggtg tctgtatggg tgtccggttg ggtcctgtgt ttggtctgcc 
4680 

ctggaggtgg ctgggccctc ctgggatggg tggctcagcc tcgaatccca ggccccagcc 
4740 

caggcaggtg ctgctgcctg ttgtggtttc ctggcccagc ttctccttct ccctctgcat 
4800 . 

aaaatcacag tccgtgagtc ttccagctgc caccacggct gggacacgct gggggagggc 
4860 

tcctcccatg cctcctgcac acagccgtct gagcagggca ggtgccaaca ccccccaccq 
492 0 

gagacacgct gcccctcagc gatgccccta ccttttgggg ggcctcgtct caagcccccc 
4980 

cttggaggct gaaatcaccc caggcactgt gagggcttct ccagggggac accctttgag 
504 0 

ctgtgggtct gatcacccca agtcccgcac acggaggaga ggcacagcca gggcgtgtgg 
5100 
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t 5160 tgttt gccccttcgg 99<=tggaggt ctcagtgttt ctagattcca gaccctgctg 

ccagagagac ctgctgccgg agagaagggg aggaggactc cagctgggct cggtccccca 
5220 

cagtcaggga cccccataaa ggacaccccc ttctctctag aaagagctgg gctctcagct 
5280 

atttctagtt gcttcccaga agccgaggag cagaaggagc tgtgagagct ttgcagaaac 
5340 . 

Scccttgtcc ccgccctcct gagctatgaa tgccgtacag agcagaggct ggggcattgg 

caagatcaca ggttgatgct gcacagcccc attgacacaa accctcaaag cagacgtgag 
5460 

Si 

agggacggtt cacaaagctt ggacctgccg tggagggtgc ccggcagacg tggcgtgaga 
5520 

gggacggctc acgaggcttg gacctgctgt ggagggtgcc cagcagacgt ggtgtgagag 
5580 

gaacggctca cgagacttgg acctggtgga gggtgcccag cagacgtggt gtgagaggga 
5640 

cggctcacag ggcttggacc ggagagagafc ggctcatgag acttggacct gccgtggagg 
5700 

gtgcccagca gacgtggtat gagagggatg gctcacgagg cttggacctg gtggagggtg 
5760 

cccggcagac gtgtgagagg gacggttcac aaggcttgga cctgccatgg agggtgccca 
5820 

gcagacgtgg tgtgagaggg acagctcacg aggcttggac ctgccgtgga gggtgcccag 
5880 

cagggggctg agctctgagg ggtgggtgct cagtgcacgg gtgcccccag tgtcctctga 
5940 

tcctgtccgg tgcctccccc aacccccaca cccatgcaga actcccaggt cacatgcacg 
6000 

tatgtccagg gcatgggggt ggcgtgaaga ggcctggtca gggcctttag gggctgcagg 
6060 

acggaatggc cacctgggga gcctgtgtgg ctgtgccggg cagccatcct gcattcccac 
6120 

ccagcgcgca gtctccacct cggccccagc aaagcgctaa gcagccggag agacagccag 
6180 
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ggcggcttcc tgaaggatgt gggatggtgg actccggggt cgagggaata cgcaggttcc 
624 0 

tgtcctccgg gagacctaga gaagctgcac acccaggagc tttccatgac ccgggagcat 
6300 

gagtgaatgg ggggttccag tttgctgaac tttgctgtct tgtaagggtg ggggctgacg 

63 60 

gccgaccctg ggaggaggtg acaccgcagg gggaggttgt gggcaacggt ggaggaggag 
6420 

agacgggagg ggaccatttg ggatggaggg gcctcttcag agttttaaaa ggcgtttgtg 

64 80 "* 

gggtggagtt gagtgtgctc tgggcttgga cacttgccgt ggtgcccctg gctggccgag 
6540 " 

gagactggct ctggccaggg ccccgtcctg agaggtcctc agcgtctgac tctcggccag 
6600 

gcgccagcaa ggaggggccg gtccccgggg ctaccaggca ggcacgtgca catcgccatc 
6660 

gccacacgcc aactccgcct gggttttaca aagtcgttgc cttaatgcat gtggacagga 
6720 

actccctgag gtcgccccat gccccctggc tgtgccaggt acggacgccc tggaccctgc 
6780 

gaacaggtgg ggcgggcgag gggcccaagg gacgggctcc agagacacgc gcagggcagg 
6840 

aggggtctca cggaggggtc tcgcactgag gcgcccagag ctggtggtcc cgctggacgc 
6900 

catccctctg cccgggatcc acacggccca cgtgtgcccg ccatgcccgc gccccacgcc 
6960 

attgcagtct tccatcctct ggccgtgacg gtggctgcag cttccccatt tgcgccgttg 
7020 

cctctggctg tctgcacttt tgttcatgct ccaaagaaca tttcataatg ccttcagtac 
7080 

cgacgtacac ttctgaccat tttgtatgtg tccttgtgcc gtagtgacca ggcctttttt 
7140 

tggtggatgt gttaccccgc acacttcaat ctcaactttg tgcaccgtcc attttctagg 
7200 

gatagacgcc cagggaatga actctagttt tctaacagat tagctgagat attaacttac 
7260 

tcacacggac aggttgatgc cagagccgta agaatgcgcc agtgcgggtt tgcgggggac 
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7320 

ttcgggtgtg gggtcctgcg gccgcgatgg ccgtggaagg ttctggggat ccctgctgcc 
7380 

acggggacga gttcggacgc caggtggacc tgtgcactca gtaaaacgca gtgattc 
7437 



<210> 34 

<211> 7437 

<212> DNA 

<213> Homo sapiens 

<400> 34 " 

gctgagcctg agcccgaccc ggggcgcctc ccgccaggca ccatggtgca gaagtcgcgc 
60 

aacggcggcg tataccccgg cccgagcggg gagaagaagc tgaaggtggg cttcgtgggg 
120 

ctggaccccg gcgcgcccga ctccacccgg gacggggcgc tgctgatcgc cggctccgag 
180 

gcccccaagc gcggcagcat cctcagcaaa cctcgcgcgg gcggcgcggg cgccgggaag 
240 

ccccccaagc gcaacgcctt ctaccgcaag ctgcagaatt tcctctacaa cgtgctggag 
300 

cggccgcgcg gctgggcgtt catctaccac gcctacgtgt tcctcctggt tttctcctgc 
360 

ctcgtgctgt ctgtgttttc caccatcaag gagtatgaga agagctcgga gggggccctc 
420 

tacatcctgg aaatcgtgac tatcgtggtg tttggcgtgg agtacttcgt gcggatctgg 
480 

gccgcaggct gctgctgccg gtaccgtggc tggagggggc ggctcaagtt tgcccggaaa 
540 

ccgttctgtg tgattgacat catggtgctc atcgcctcca ttgcggtgct ggccgccggc 
600 

tcccagggca acgtctttgc cacatctgcg ctccggagcc tgcgcttcct gcagattctg 
660 

cggatgatcc gcatggaccg gcggggaggc acctggaagc tgctgggctc tgtggtctat 
720 

gcccacagca aggagctggt cactgcctgg tacatcggct tcctttgtct catcctggcc 
780 

tcgttcctgg tgtacttggc agagaagggg gagaacgacc actttgacac ctacgcggat 
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840 

gC 900° tggt 9gggcctgat cac Sctgacc accattggct acggggacaa gtacccccag 

acctggaacg gcaggctcct tgcggcaacc ttcaccctca tcggtgtctc cttcttcgcg 
9 o 0 

°^non tSCag 9Catctt 939 gtctgggttt gccctgaagg ttcaggagca gcacaggcag 

a fnQ aCtttg a ^ aa ^aggcg gaacccggca gcaggcctga tccagtcggc ctggagattc 
1 0 8 0 

ta ??^ Cacca acctctc gg9 cacagacctg cactccacgt ggcagtacta cgagcgaacg 
1140 

gtcaccgtgc ccatgtacag ttcgcaaact caaacctacg gggcctccag acttatcccc 
120 0 

ccgctgaacc agctggagct gctgaggaac ctcaagagta aatctggact cgctttcagg 
1260 

aaggaccccc cgccggagcc gtctccaagc cagaaggtca gtttgaaaga tcgtgtcttc 
13 20 

tccagccccc gaggcgtggc tgccaagggg aaggggtccc cgcaggccca gactgtgagg 

13 8 0 

cggtcaccca gcgccgacca gagcctcgag gacagcccca gcaaggtgcc caagagctgg 

1 4 4 0 

agcttcgggg accgcagccg ggcacgccag gctttccgca tcaagggtgc cgcgtcacgg 
1500 " 

cagaactcag aagaagcaag cctccccgga gaggacattg tggatgacaa gagctgcccc 
1560 

^cgagtttg tgaccgagga cctgaccccg ggcctcaaag tcagcatcag agccgtgtgt 

gfccatgcggt tcctggtgtc caagcggaag fctcaaggaga gcctgcggcc ctacgacgtg 
1680 

a ^ggacgtca tcgagcagta ctcagccggc cacctggaca tgctgtcccg aattaagagc 

c ^gcagtcca gagtggacca gatcgtgggg cggggcccag cgatcacgga caaggaccgc 

accaagggcc cggccgaggc ggagctgccc gaggacccca gcatgatggg acggctcggg 
1860 

aaggtggaga agcaggtctt gtccatggag aagaagctgg acttcctggt gaatatctac 
1920 
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atgcagcgga tgggcatccc cccgacagag accgaggcct actttggggc caaagaqcca 
1980 

gagccggcgc cgccgtacca cagcccggaa gacagccggg agcatgtcga caggcacggc 
2040 

t 2100 tgtCa agatcgtgcg ctcca gcagc tccacgggcc agaagaactt ctcggcgccc 
ccggccgcgc cccctgtcca gtgtccgccc tccacctcct ggcagccaca gagccacccg 

cgccagggcc acggcacctc ccccgtgggg gaccacggct ccctggtgcg catcccgccg 
2220 

ccgcctgccc acgagcggtc gctgtccgcc tacggcgggg gcaaccgcgc cagcatggag 
2280 

ttcctgcggc aggaggacac cccgggctgc aggccccccg aggggaccct gcgggacagc 
2340 

gacacgtcca tctccatccc gtccgtggac cacgaggagc tggagcgttc cttcagcggc 
2400 

ttcagcatct cccagtccaa ggagaacctg gatgctctca acagctgcta cgcggccgtg 
2460 

gcgccttgtg ccaaagtcag gccctacatt gcggagggag agtcagacac cgactccgac 
2520 

ctctgtaccc cgtgcgggcc cccgccacgc tcggccaccg gcgagggtcc ctttggtgac 
2580 

gtgggctggg ccgggcccag gaagtgaggc ggcgctgggc cagtggaccc gcccgcggcc 
2640 

ctcctcagca cggtgcctcc gaggttttga ggcgggaacc ctctggggcc cttttcttac 
2700 

agtaactgag tgtggcggga agggtgggcc ctggaggggc ccatgtgggc tgaaggatgg 
2760 

gggctcctgg cagtgacctt ttacaaaagt tattttccaa cagggcactc ccaggccctg 
2820 

tcgccattga ggtgcctccg ctgggctgtc tcctcacccc tccctgtgct ggagcctgtc 
2880 

ccaaaaaggt gccaactggg aggcctcgga agccactgtc caggctccca ctgcctgtct 
2940 

gctctgttcc caaaggcagc gtgtgtggcc tcgggccctg cggtggcatg aagcatccct 
3000 
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3060 93 gcatcgctac fftgttttggg ggcagcgttt cacggcggtg cccttgctgt 

C 3120 tt9 " ctggctcgag cctggggtcc atgtcccttt gccgtcccgt catggggcag 
9 ?f ort CCata 9 c 9999 ccca caggcagggg tatgagtgcg tcccacccaa cgcagcacca 
9 324o" CCa ccgctccccg tstccccagt tccgtctcag ctacctggac tccaggaccc 
t ??nn aa " 3 agacctggca gtggagggag gctgtgctgt gtgtccccct gcaggtgtga 

*J ««J \J V 

ccccgcctgc tctttcctcc cccgccaggt gtggccccgc ctgctctttc ctcccccacc 
3 3 6 0 

agtatggccc cacctgctct ttcctccccc cccaaggtgt ggccccacct gttctttcct 
3 4 2 0 

cccctgccga ggtgtgaccc cacctgctct ttcctccctc ccagtatggc cccacctgct 
3 480 

ctttcctccc ccgaggtgag gccccgcctg ctctttcctc ccatgggagc cgctgaggcg 
3540 

tgcgcacctg ggcacaggtt ggggctctgc aggatgagga agacaggcca atcccttccc 
3 600 

tcccagaagc tggccgccca gcaggaggga ctgaggccag actcatgtcc agcaaggaac 
3 660 

9tgcggtgtg tcccctggga agtctctggg ccctgggaag agggaaggtg cacgtcctgg 
3720 

g atggttgcg gggccctgtt ttgggagaca aaggggtaga gggtctgtct tgggcccccc 
37 80 

C 3840 tCtag cccgagcagt Sr ca 9 cca cct actgccccac ctcagagaag tgcagcggga 

aggaggctgg aggtggtgcg gcgctgcctc gggtgtctgc gtgaatgagc gtggccaagg 
•3 9 0 

accagtgcca cctcatggca aagagctccc gcagtgtttg ttagagtgca catcctacgt 
3960 

gcccactggc acacacacgt gctcacatac atgtccgcgt acaggcgtac acatgcacgc 
4020 

ttgcacacat gcacacagac cacatagcac acatgtgcac tgaccacacc tgtatagacc 
4080 

atgcacagta cacatacgtg catacacatg cctgcataca ggcatacaca tgcacgctta 
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r 


Jill ^-k — _ 

cacgtacacg 
4200 


tgeacagate 


acacacatgc 


acacacgtgt 


agctcacaca 


cagtatacac 


atacacaagt 
4260 


gcacagacca 


cacacagcac 


taacacatgc 


acacacaaag 


tgeataggee 


acacagcaca 
4320 


tgcacacagg 


tgcacagacc 


acacagcaca 


cacaagtgea 


cagagcacac 


t~ t^tf* s 4~ 

L.y LaLaCaUy 

4380 


cacacacaca 


cgcgtgcatg 


cacactcctc 


gcac t tccag 


ccttggagcc 


C CCCCy cccc 
4440 


tggtctttct 


ctttgaccct 


gctgagtgta 


age tgee tgq 


ggaggggcta 


caaggagtaa 
4500 


ttgtggcttt 


aggggtcgtg 


gtgatgctgg 


aatgtcaagc 


gccgtcgtgg ■ 


ggtatccgac 
4560 


tgtceggget 


cctggtccgc 


agtggcagag 


cgccaggcag 


agecaatcag 


ggtctcgtgc 
4620 


tgcccttccc 


ccccacagcc 


tggcagecat 


ccagaggagg 


ggctctacca 


■ 

ga tgccaagg 
4680 


tgccccggtg 


tc tgtatggg 


tgtccggttg 


ggtcc tgtgt 


ttggtctgcc 


ctggaggtgg 
4740 


ctgggccctc 


etgggatggg 


■ 

tggctcagcc 


tcgaatccca 


ggccccagcc 


caggcaggtg 
4800 


ctgctgcctg 


ttgtggtttc 


ctggcccagc 


ttc tccttct 


ccctctgcat 


aaaatcacag 
4860 


tccgtgagtc 


ttccagctgc 


caccacggct 


gggacacget 


gggggagggc 


ccccccca eg 
4920 


cctcctgcac 


acagccgtct 


gagcagggca 


ggtgccaaca 


ccccccaccg 


gagacacget 
4980 


crcccctcacrc 


era tcr cccc ta 


rctt fc fccrercrer 
v *w u *-y y y y 


y y ^ w u l y l, v— l. * 


caagcccccc 


cttggaggct 
5040 


gaaatcaccc 


caggcactgt 


gagggct tct 


ccagggggac 


accctt tgag 


ctgtgggtct 
5100 


gatcacccca 


agtcccgcac 


aeggaggaga 


ggcacagcca 


gggcgtgtgg 


tttaatgttt 
5160 


gccccttcgg 


ggctggaggt 


ctcagtgttt 


ctagattcca 


gaccctgctg 


ccagagagac 
5220 


ctgctgccgg 


agagaagggg 


aggaggactc 


cagctgggct 


cggtccccca 




V 
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tcgatgtgga tgagaagaac caaatgatga ccaccaacgt ctggctaaaa caggagtgga 
!9 ^3 ^) 

gcgactacaa actgcgctgg aaccccgctg attttggcaa catcacatct ctcagggtcc 

c ^tctgagat gatctggatc cccgacattg ttctctacaa caatgcagat ggggagtttg 

cagtgaccca catgaccaag gcccacctct tctc'cacggg cactgtgcac tgggtgcccc 
1080 

cggccatcta caagagctcc tgcagcatcg acgtcacctt cttccccttc gaccagcaga 
1140 

actgcaagat gaagtttggc tcctggactt atgacaaggc caagatcgac ctggagcaga 
120 0 

tggagcagac tgtggacctg aaggactact gggagagcgg cgagtgggcc atcgtcaatg 
12 60 

ccacgggcac ctacaacagc aagaagtacg actgctgcgc cgagatctac cccgacgtca 
1320 

cctatgcctt cgtcatccgg cggctgccgc tcttctacac catcaacctc atcatcccct 
1380 

gcctgctcat ctcctgcctc actgtgctgg tcttctacct gccctccgac tgcggcgaga 
1440 

agatcacgct gtgcatttcg gtgctgctgt cactcaccgt cttcctgctg ctcatcactg 
1500 

agatcatccc gtccacctcg ctggtcatcc cgctcatcgg cgagtacctg ctgttcacca 
1560 

tgatcttcgt caccctgtcc atcgtcatca ccgtcttcgt gctcaatgtg caccaccgct 
1620 

cccccagcac ccacaccatg ccccactggg tgcggggggc ccttctgggc tgtgtgcccc 
1680 

ggtggcttct gatgaaccgg cccccaccac ccgtggagct ctgccacccc ctacgcctga 
1740 

agctcagccc ctcttatcac tggctggaga gcaacgtgga tgccgaggag agggaggtgg 
1800 

tggtggagga ggaggacaga tgggcatgtg caggtcatgt ggccccctct gtgggcaccc 
1860 

tctgcagcca cggccacctg cactctgggg cctcaggtcc caaggctgag gctctgctgc 
1920 
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aggagggtga gctgctgcta tcaccccaca tgcagaaggc actggaaggt gtgcactaca 

ttgccgacca cctgcggtct gaggatgctg actcttcggt gaaggaggac tggaagtatg 
2040 

i 

ttgccatggt catcgacagg atcttcctct ggctgtttat catcgtctgc ttcctgggga 
2100 

ccatcggcct ctttctgcct ccgttcctag ctggaatgat ctgactgcac ctccctcgag 
2160 

ctggctccca gggcaaaggg gagggttctt ggatgtggaa gggctttgaa caatgtttag 
2220 

atttggagat gagcccaaag tgccagggag aacagccagg tgaggtggga ggttggagag 
2280 

ccaggtgagg tctctctaag tcaggctggg gttgaagttt ggagtctgtc cgagtttgca 
2340 

gggtgctgag ctgtatggtc cagcagggga gtaataaggg ctcttccgga aggggaggaa 
2400 

gcgggaggca ggcctgcacc tgatgtggag gtacaggcag atcttcccta ccggggaggg 
2460 

atggatggtt ggatacaggt ggctgggcta ttccatccat ctggaagcac atttgagcct 
2520 

ccaggcttct ccttgacgtc attcctctcc ttccttgctg caaaatggct ctgcaccagc 
2580 

cggcccccag gaggtctggc agagctgaga gccatggcct gcaggggctc catatgtccc 
2640 

tacgcgtgca gcaggcaaac aaga 
2664 

<210> 25 

<211> 3020 

<212> DNA 

<213> Homo sapiens 

<400> 25 

gtcctcccgc gggtccgagg gcgctggaaa cccagcggcg gcgaagcgga gaggagcccc 
60 

gcgcgtctcc gcccgcacgg ctccaggtct ggggtctgcg ctggagccgc gcggggagag 
120 

gccgtctctg cgaccgccgc gcccgctccc gaccgtccgg gtccgcggcc agcccggcca 
180 
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cagtcaggga cccccataaa ggacaccccc ttctctctag aaagagctgg gctctcagct 
5280 

atttctagtt gcttcccaga agccgaggag cagaaggagc tgtgagagct ttgcagaaac 
534 0 

gcccttgtcc ccgccctcct gagctatgaa tgccgtacag agcagaggct ggggcattgg 
54 00 

caagatcaca ggttgatgct gcacagcccc attgacacaa accctcaaag cagacgtgag 
5460 

agggacggtt cacaaagctt ggacctgccg tggagggtgc ccggcagacg tggcgtgaga 
5520 

gggacggctc acgaggcttg gacctgctgt ggagggtgcc cagcagacgt ggtgtgagag 
5580 

gaacggctca cgagacttgg acctggtgga gggtgcccag cagacgtggt gtgagaggga 
5640 

cggctcacag ggcttggacc ggagagagat ggctcatgag acttggacct gccgtggagg 
*3 *7 0 

gtgcccagca gacgtggtat gagagggatg gctcacgagg cttggacctg gtggagggtg 
57 6 0 

cccggcagac gtgtgagagg gacggttcac aaggcttgga cctgccatgg agggtgccca 
5820 

gcagacgtgg tgtgagaggg acagctcacg aggcttggac ctgccgtgga gggtgcccag 
5880 

cagggggctg agctctgagg ggtgggtgct cagtgcacgg gtgcccccag tgtcctctga 
5940 

tcctgtccgg tgcctccccc aacccccaca cccatgcaga actcccaggt cacatgcacg 
6000 

tatgtccagg gcatgggggt ggcgtgaaga ggcctggtca gggcctttag gggctgcagg 
6060 

acggaatggc cacctgggga gcctgtgtgg ctgtgccggg cagccatcct gcattcccac 
6120 

ccagcgcgca gtctccacct cggccccagc aaagcgctaa gcagccggag agacagccag 
6180 

ggcggcttcc tgaaggatgt gggatggfcgg actccggggt cgagggaata cgcaggttcc 
6240 

tgtcctccgg gagacctaga gaagctgcac acccaggagc tttccatgac ccgggagcat 
63 00 
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gagtgaatgg ggggttccag tttgctgaac tttgctgtct tgtaagggtg ggggctgacg 
63 60 

g 6420 CCCt9 ggaggaggtg acacc 9 ca gg gggaggttgt gggcaacggt ggaggaggag 

agacgggagg ggaccatttg ggatggaggg gcctcttcag agttttaaaa ggcgtttgtg 
6480 

gggtggagtt gagtgtgctc tgggcttgga cacttgccgt ggtgcccctg gctggccgag 
654 0 "* " 

9||actggct ctggccaggg ccccgtcctg agaggtcctc agcgtctgac tctcggccag 

gcgccagcaa ggaggggccg gtccccgggg ctaccaggca ggcacgtgca catcgccatc 
6660 

gccacacgcc aactccgcct gggttttaca aagtcgttgc cttaatgcat- gtggacagga 
6720 

actccctgag gtcgccccat gccccctggc tgtgccaggt acggacgccc tggaccctgc 
67 80 

gaacaggtgg ggcgggcgag gggcccaagg gacgggctcc agagacacgc gcagggcagg 
6840 

aggggtctca cggaggggtc tcgcactgag gcgcccagag ctggtggtcc cgctggacgc 
6900 

catccctctg cccgggatcc acacggccca cgtgtgcccg ccatgcccgc gccccacgcc 
6960 

abtgcagtct tccatcctct ggccgtgacg gtggctgcag cttccccatt tgcgccgttg 
7020 r 

cctctggctg tctgcacttt tgttcatgct ccaaagaaca tttcataatg ccttcagtac 
7080 

cgacgtacac ttctgaccat tttgtatgtg tccttgtgcc gtagtgacca ggcctttttt 
7140 

tggtggatgt gttaccccgc acacttcaat ctcaactttg tgcaccgtcc attttctagg 
7200 

gatagacgcc cagggaatga actctagttt tctaacagat tagctgagat attaacttac 
7260 

tcacacggac aggttgatgc cagagccgta agaatgcgcc agtgcgggtt tgcgggggac 
7320 

ttcgggtgtg gggtcctgcg gccgcgatgg ccgtggaagg ttctggggat ccctgctgcc 
7380 

acggggacga gttcggacgc caggtggacc tgtgcactca gtaaaacgca gtgattc 
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7437 

<210> 35 

<211> 7437 

<212> DNA 

<213> Homo sapiens 

<400> 35 

gctgagcctg agcccgaccc ggggcgcctc ccgccaggca ccatggtgca gaagtcgcgc 
60 

aacggcggcg tataccccgg cccgagcggg gagaagaagc tgaaggtggg cttcgtgggg 
120 

ctggaccccg gcgcgcccga ctccacccgg gacggggcgc tgctgatcgc cggctccgag 
1 8 0 

» 

gcccccaagc gcggcagcat cctcagcaaa cctcgcgcgg gcggcgcggg cgccgggaag 
240 

ccccccaagc gcaacgcctt ctaccgcaag ctgcagaatt tcctctacaa cgtgctggag 

3 00 

cggccgcgcg gctgggcgtt catctaccac gcctacgtgt tcctcctggt tttctcctgc 
360 

ctcgtgctgt ctgtgttttc caccatcaag gagtatgaga agagctcgga gggggccctc 
420 

tacatcctgg aaatcgtgac tatcgtggtg tttggcgtgg agtacttcgt gcggatctgg 

4 80 

gccgcaggct gctgctgccg gtaccgtggc tggagggggc ggctcaagtt tgcccggaaa 
54 0 

ccgttctgtg tgattgacat catggtgctc atcgcctcca ttgcggtgct ggccgccggc 
600 

tcccagggca acgtctttgc cacatctgcg ctccggagcc tgcgcttcct gcagattctg 
660 " ~ 

* 

cggatgatcc gcatggaccg gcggggaggc acctggaagc tgctgggctc tgtggtctat 
720 

gcccacagca aggagctggt cactgcctgg tacatcggct tcctttgtct catcctggcc 
7 80 

tcgttcctgg tgtacttggc agagaagggg gagaacgacc actttgacac ctacgcggat 
840 ~ - 

STcactctggt ggggcctgat cacgctgacc accattggct acggggacaa gtacccccag 

acctggaacg gcaggctcct tgcggcaacc ttcaccctca tcggtgtctc cttcttcgcg 
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960 

C 1020 t9Cag gcatcttggg Sr tc tgggttt gccctgaagg ttcaggagca gcacaggcag 

aagcactttg agaagaggcg gaacccggca gcaggcctga tccagtcggc ctggagattc 
1080 " " ~ 

t 1140 CaCCa acctctcgcg cac agacctg cactccacgt ggcagtacta cgagcgaacg 

gtcaccgtgc ccatgtacag ttcgcaaact caaacctacg gggcctccag acttatcccc 
1200 

ccgctgaacc agchggagct gctgaggaac -ctcaagagta aatctggact cgctttcagg 
1260 

aaggaccccc cgccggagcc gtctccaagc cagaaggtca gtttgaaaga tcgtgtcttc 
1320 

tccagcccct gaggcgtggc tgccaagggg aaggggtccc cgcaggccca gactgtgagg 
1380 

cggtcaccca gcgccgacca gagcctcgag gacagcccca gcaaggtgcc caagagctgg 
1440 

agcttcgggg accgcagccg ggcacgccag gctttccgca tcaagggtgc cgcgtcacgg 
1500 . 

# 

cagaactcag aagaagcaag cctccccgga gaggacattg tggatgacaa gagctgcccc 
1560 

tgcgagtttg tgaccgagga cctgaccccg ggcctcaaag tcagcatcag agccgtgtgt 
1620 

gtcatgcggt tcctggtgtc caagcggaag ttcaaggaga gcctgcggcc ctacgacgtg 
1680 

atggacgtca tcgagcagta ctcagccggc cacctggaca tgctgtcccg aattaagagc 
1740 

ctgcagtcca gagtggacca gatcgtgggg cggggcccag cgatcacgga caaggaccgc 
1800 

accaagggcc cggccgaggc ggagctgccc gaggacccca gcatgatggg acggctcggg 
1860 

aaggtggaga agcaggtctt gtccatggag aagaagctgg acttcctggt gaatatctac 
1920 

atgcagcgga tgggcatccc cccgacagag accgaggcct actttggggc caaagagccg 
1980 

gagccggcgc cgccgtacca cagcccggaa gacagccggg agcatgtcga caggcacggc 
2040 
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t |cattgtca agatcgtgcg ctccagcagc tccacgggcc agaagaactt ctcggcgccc 

ccggccgcgc cccctgtcca gtgtccgccc tccacctcct ggcagccaca gagccacccc? 
2160 

cgccagggcc acggcacctc ccccgtgggg gaccacggct ccctggtgcg catcccgccg 
2220 

ccgcctgccc acgagcggtc gctgtccgcc tacggcgggg gcaaccgcgc cagcatggag 
2280 

ttcctgcggc aggaggacac cccgggctgc aggccccccg aggggaccct gcgggacagc 

gacacgtcca tctccatccc gtccgtggac cacgaggagc tggagcgttc cttcagcggc 
2400 . • 

ttcagcatct cccagtccaa ggagaacctg gatgctctca acagctgcta cgcggccgtg 
2460 

gcgccttgtg ccaaagtcag gccctacatt gcggagggag agtcagacac cgactccgac 
2520 

ctctgtaccc cgtgcgggcc cccgccacgc tcggccaccg gcgagggtcc ctttggtgac 
2580 

gtgggctggg ccgggcccag gaagtgaggc ggcgctgggc cagtggaccc gcccgcggcc 
2640 

ctcctcagca cggtgcctcc gaggttttga ggcgggaacc ctctggggcc cttttcttac 
2700 

agtaactgag tgtggcggga agggtgggcc ctggaggggc ccatgtgggc tgaaggatgg 
2760 

gggctcctgg cagtgacctt ttacaaaagt tattttccaa cagggcactc ccaggccctg 
2820 

tcgccattga ggtgcctccg ctgggctgtc tcctcacccc tccctgtgct ggagcctgtc 
2880 

ccaaaaaggt gccaactggg aggcctcgga agccactgtc caggctccca ctgcctgtct 
2940 

gctctgttcc caaaggcagc gtgtgtggcc tcgggccctg cggtggcatg aagcatccct 
3000 

tctggtgtgg gcatcgctac gtgttttggg ggcagcgttt cacggcggtg cccttgctgt 
3060 

ctcccttggg ctggctcgag cctggggtcc atgtcccttt gccgtcccgt catggggcag 
3120 
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9 ??on CCata gc ^^ ccca caggcagggg tatgagtgcg tcccacccaa cgcagcacca 
3180 

gccccggcca ccgctccccg tgtccccagt tccgtctcag ctacctggac tccaggaccc 
3240 , 

tggagaaggg agacctggca gtggagggag gctgtgctgt gtgtccccct gcaggtgtga 
«3 3 0 (3 

ccccgcctgc tctttcctcc cccgccaggt gtggccccgc ctgctctttc ctcccccacc 
3360 

agtatggccc cacctgctct ttcctccccc cccaaggtgt ggccccacct gttctttcct 
3420 " ~ < 

cccctgccga ggtgtgaccc cacctgctct ttcctccctc ccagtatggc cccacctgct 
3480 

c ^ttcctccc ccgaggtgag gccccgcctg ctctttcctc ccatgggagc cgctgaggcg 

tgcgcacctg ggcacaggtt ggggctctgc aggatgagga agacaggcca atcccttccc 
3600 

tcccagaagc tggccgccca gcaggaggga ctgaggccag actcatgtcc agcaaggaac 
3 660 

gtgtggtgtg tcccctggga agtctctggg ccctgggaag agggaaggtg cacgtcctgg 
3720 

gatggttgcg gggccctgtt ttgggagaca aaggggtaga gggtctgtct tgggcccccc 
3780 

cagactctag cccgagcagt gcagccacct actgccccac ctcagagaag tgcagcggga 
3 84 0 

aggaggctgg aggtggtgcg gcgctgcctc gggtgtctgc gtgaatgagc gtggccaagg 
3900 

accagtgcca cctcatggca aagagctccc gcagtgtttg ttagagtgca catcctacgt 
3960 

gcccactggc acacacacgt gctcacatac atgtccgcgt acaggcgtac acatgcacgc 
4020 

ttgcacacat gcacacagac cacatagcac acatgtgcac tgaccacacc tgtatagacc 
4080 

atgcacagta cacatacgtg catacacatg cctgcataca ggcatacaca tgcacgctta 
4140 

catgtacacg tgcacagatc acacacatgc acacacgtgt agctcacaca cagtatacac 
4200 

atacacaagt gcacagacca cacacagcac taacacatgc acacacaaag tgcataggcc 
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4260 

■ 

acacagcaca tgcacacagg tgcacagacc acacagcaca cacaagtgca cagagcacac 
4320 

tgcacacatg cacacacaca cgcgtgcatg cacactcctc gcacttccag ccttggagcc 
4380 

cttctgtctc tggtctttct ctttgaccct gctgagtgta agctgcctgg ggaggggcta 
4440 

caaggagtaa ttgtggcttt aggggtcgtg gtgatgctgg aatgtcaagc gccgtcgtgg 
4 500 . 

* 

ggtatccgac tgtccgggct cctggtccgc agtggcagag cgccaggcag agccaatcag 
4560 

ggtctcgtgc tgcccttccc ccccacagcc tggcagccat ccagaggagg ggctctacca 
4620 

gatgccaagg tgccccggtg tctgtatggg tg.tccggttg ggtcctgtgt ttggtctgcc 
4 680 

ctggaggtgg ctgggccctc ctgggatggg tggctcagcc tcgaatccca ggccccagcc 
4740 

caggcaggtg ctgctgcctg ttgtggtttc ctggcccagc ttctccttct ccctctgcat 
4800 

aaaatcacag tccgtgagtc ttccagctgc caccacggct gggacacgct gggggagggc 
4860 

tcctcccatg cctcctgcac acagccgtct gagcagggca ggtgccaaca ccccccaccg 
4920 

gagacacgct gcccctcagc gatgccccta ccttttgggg ggcctcgtct caagcccccc 
4980 

cttggaggct gaaatcaccc caggcactgt gagggcttct ccagggggac accctttgag 
5040- 

ctgtgggtct gatcacccca agtcccgcac acggaggaga ggcacagcca gggcgtgtgg 
5100 

tttaatgttt gccccttcgg ggctggaggt ctcagtgttt ctagattcca gaccctgctg 
5160 

ccagagagac ctgctgccgg agagaagggg aggaggactc cagctgggct cggtccccca 
5220 

cagtcaggga cccccataaa ggacaccccc ttctctctag aaagagctgg gctctcagct 
5280 

atttctagtt gcttcccaga agccgaggag cagaaggagc tgtgagagct ttgcagaaac 
5340 
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gcccttgtcc ccgccctcct gagctatgaa tgccgtacag agcagaggct ggggcattgg 
5400 

C 5460 tCaCa ggttgabgc;:t g caca 9reccc attgacacaa accctcaaag cagacgtgag 

agggacggtt cacaaagctt ggacctgccg tggagggtgc ccggcagacg tggcgtgaga 

gggacggctc acgaggcttg gacctgctgt ggagggtgcc cagcagacgt ggtgtgagag 
5580 

gaacggctca cgagacttgg acctggtgga gggtgcccag cagacgtggt gtgagaggga 
5 6 0 

cggctcacag ggcttggacc ggagagagat ggctcatgag acttggacct gccgtggagg 
57 00 

gtgcccagca gacgtggtat gagagggatg gctcacgagg cttggacctg gtggagggtg 
5760 

cccggcagac gtgtgagagg gacggttcac aaggcttgga cc tgccatgg agggtgccca 
5820 

gcagacgtgg tgtgagaggg acagctcacg aggcttggac ctgccgtgga gggtgcccag 
5880 

cagggggctg agctctgagg ggtgggtgct cagtgcacgg gtgcccccag tgtcctctga 
5940 

tcctgtccgg tgcctccccc aacccccaca cccatgcaga actcccaggt cacatgcacg 
6000 

tatgtccagg gcatgggggt ggcgtgaaga ggcctggtca gggcctttag gggctgcagg 
60 60 

acggaatggc cacctgggga gcctgtgtgg ctgtgccggg cagccatcct gcattcccac 
6120 

ccagcgcgca gtctccacct cggccccagc aaagcgctaa gcagccggag agacagccag 
6180 

ggcggcttcc tgaaggatgt gggatggtgg actccggggt cgagggaata cgcaggttcc 
624 0 

tgtcctccgg gagacctaga gaagctgcac acccaggagc tttccatgac ccgggagcat 
6300 

gagtgaatgg ggggttccag tttgctgaac tttgctgtct tgtaagggtg ggggctgacg 
6360 

gccgaccctg ggaggaggtg acaccgcagg gggaggttgt gggcaacggt ggaggaggag 
6420 
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agacgggagg ggaccatttg ggatggaggg gcctcttcag agttttaaaa ggcgtttgtg 
6480 



gggtggagtt gagtgtgctc tgggcttgga cacttgccgt ggtgcccctg gctggccgag 
6540 



gagactggct ctggccaggg ccccgtcctg agaggtcctc agcgtctgac tctcggccag 
660 0. 



gcgccagcaa ggaggggccg gtccccgggg ctaccaggca ggcacgtgca catcgccatc 
6660 



gccacacgcc aactccgcct gggttttaca aagtcgttgc cttaatgcat gtggacagga 
6720 



actccctgag gtcgccccat gccccctggc tgtgccaggt acggacgccc tggaccctgc 
6780 . 



gaacaggtgg ggcgggcgag gggcccaagg gacgggctcc agagacacgc gcagggcagg 
6840 . - - ^ 



aggggtctca cggaggggtc tcgcactgag gcgcccagag ctggtggtcc cgctggacgc 
6900 



catccctctg cccgggatcc acacggccca cgtgtgcccg ccatgcccgc gccccacgcc 
6960 



attgcagtct tccatcctct ggccgtgacg gtggctgcag cttccccatt tgcgccgttg 
7020 



cctctggctg tctgcacttt tgttcatgct ccaaagaaca tttcataatg ccttcagtac 
7 080 



cgacgtacac ttctgaccat tttgtatgtg tccttgtgcc gtagtgacca ggcctttttt 
7140 



tggtggatgt gttaccccgc acacttcaat ctcaactttg tgcaccgtcc attttctagg 
7200 



gatagacgcc cagggaatga actctagttt tctaacagat tagctgagat attaacttac 
7260 



tcacacggac aggttgatgc cagagccgta agaatgcgcc agtgcgggtt tgcgggggac 
7320 



ttcgggtgtg gggtcctgcg gccgcgatgg ccgtggaagg ttctggggat ccctgctgcc 
7380 



acggggacga gttcggacgc caggtggacc tgtgcactca gtaaaacgca gtgattc 
7437 



<210> 36 
<211> 7437 
<212> DNA 
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<213> Homo sapiens 

<400> 36 

gctgagcctg agcccgaccc ggggcgcctc ccgccaggca ccatggtgca gaagtcgcgc 
60 

aacggcggcg tataccccgg cccgagcggg gagaagaagc tgaaggtggg cttcgtgggg 
120 

ctggaccccg gcgcgcccga ctccacccgg gacggggcg : c tgctgatcgc cggctccgag 
180 

gcccccaagc gcggcagcat cctcagcaaa cctcgcgcgg gcggcgcggg cgccgggaag 
240 

ccccccaagc gcaacgcctt ctaccgcaag ctgcagaatt tcctctacaa cgtgctggag 
300 

cggccgcgcg gctgggcgtt catctaccac gcctacgtgt tcctcctggt tttctcctgc 
360 

ctcgtgctgt ctgtgttttc caccatcaag gagtatgaga agagctcgga gggggccctc 
420 

tacatcctgg aaatcgtgac tatcgtggtg tttggcgtgg agtacttcgt gcggatctgg 
480 

gccgcaggct gctgctgccg gtaccgtggc tggagggggc ggctcaagtt tgcccggaaa 
540 

ccgttctgtg tgattgacat catggtgctc atcgcctcca ttgcggtgct ggccgccggc 
600 

tcccagggca acgtctttgc cacatctgcg ctccggagcc tgcgcttcct gcagattctg 
660 

cggatgatcc gcatggaccg gcggggaggc acctggaagc tgctgggctc tgtggtctat 
720 

gcccacagca aggagctggt cactgcctgg tacatcggct tcctttgtct catcctggcc 
780 

tcgttcctgg tgtacttggc agagaagggg gagaacgacc actttgacac ctacgcggat 
840 

gcactctggt ggggcctgat cacgctgacc accattggct acggggacaa gtacccccag 
900 

acctggaacg gcaggctcct tgcggcaacc ttcaccctca tcggtgtctc cttcttcgcg 
960 

ctgcctgcag gcatcttggg gtctgggttt gccctgaagg ttcaggagca gcacaggcag 
1020 

aagcactttg agaagaggcg gaacccggca gcaggcctga tccagtcggc ctggagattc 

Page 105 



SequencesSSCPre-file7August03 .ST25 . txt* 

1080 

t 1140 CaC ° a acctctcgcg cacagacctg cactccacgt ggcagtacta cgagcgaacg 
9 1200 C9tgC ccatgtacag ttcgcaaact caaacctacg gggcctccag acttatcccc 
C 1260 9aaCC agctggagct gctgaggaac ctcaagagta aatctggact cgctttcagg 
a i320 CCCCC cgccggagcc Stctccaagc cagaaggtca gtttgaaaga tcgtgtcttc 

t ??on CCCCC gaggcgtg 9 c tgccaagggg aaggggtccc cgcaggccca gactgtgagg 
13 80 

C 1440 aCCCa gcgccgacca 9agcctcgag gacagcccca gcaaggtgcc caagagctgg 

a ?£nn Cg9gg acc 9 ca S cc 9 ggcacgccag gctttccgca tcaagggtgc cgcgtcacgg 

■X J V w 

cagaactcag aagaagcaag cctccccgga gaggacattg tggatgacaa gagctgcccc 

1 b 6 0 

tgcgagtttg tgaccgagga cctgaccccg ggcctcaaag tcagcatcag agccgtgtgt 
162 0 

gtcatgcggt tcctggtgtc caagcggaag ttcaaggaga gcctgcggcc ctacgacgtg 
1680 

a i740 CgtCa tcgagcagta ctcagccggc cacctggaca tgctgtcccg aattaagagc 

C ^?nn 9tCCa g^gs 3003 gatcgtgggg cggggcccag cgatcacgga caaggaccgc 
1800 * 

accaagggcc cggccgaggc ggagctgccc gaggacccca gcatgatggg acggctcggg 
1860 ^ 

aaggtggaga agcaggtctt gtccatggag aagaagctgg acttcctggt gaatatctac 
1920 

atgcagcgga tgggcatccc cccgacagag accgaggcct actttggggc caaagagccg 
1980 

gagccggcgc cgccgtacca cagcccggaa gacagccggg agcatgtcga caggcacggc 
2 0 4 0 

tgcattgtca agatcgtgcg ctccagcagc tccacgggcc agaagaactt ctcggcgccc 
2100 

ccggccgcgc cccctgtcca gtgtccgccc tccacctcct ggcagccaca gagccacccg 
2160 
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cgccagggcc acggcacctc ccccgtgggg gaccacggct ccctggtgcg catcccgccg 
2220 

ccgcctgccc acgagcggtc gctgtccgcc tacggcgggg gcaaccgcgc cagcatggag 
2280 

ttcctgcggc aggaggacac cccgggctgc aggccccccg aggggaccct gcgggacagc 

23 4 0 

9|^acgtcca tctccatccc gtccgtggac cacgaggagc tggagcgttc cttcagcggc 

ttcagcatct cccagtccaa ggagaacctg gatgctctca acagctgcta cgcggccgtg 

24 60 

gcgccttgtg ccaaagtcag. gccctacatt gcggagggag agtcagacac cgactccgac 
2 52 0* 

ctctgtaccc cgtgcgggcc cccgccacgc tcggccaccg gcgagggtcc ctttggtgac 
2580 

* * 

gtgggctggg ccgggcccag gaagtgaggc ggcgctgggc cagtggaccc gcccgcggcc 
2 640 

* ■ * 

ctcctcagca cggtgcctcc gaggttttga ggcgggaacc ctctggggcc cttttcttac 
2700 

agtaactgag tgtggcggga agggtgggcc ctggaggggc ccatgtgggc tgaaggatgg 
2760 

gggctcctgg cagtgacctt ttacaaaagt tattttccaa cagggcactc ccaggccctg 
2 82 0 

tcgccattga ggtgcctccg ctgggctgtc tcctcacccc tccctgtgct ggagcctgtc 
2880 

ccaaaaaggt gccaactggg aggcctcgga agccactgtc caggctccca ctgcctgtct 
2940 

gctctgttcc caaaggcagc gtgtgtggcc tcgggccctg cggtggcatg aagcatccct 
3000 

tctggtgtgg gcatcgctac gtgttttggg ggcagcgttt cacggcggtg cccttgctgt 
3060 

ctcccttggg ctggctcgag cctggggtcc atgtcccttt gccgtcccgt catggggcag 
3120 

ggaatccata gcggggccca caggcagggg tatgagtgcg tcccacccaa cgcagcacca 
3180 

gccccggcca ccgctccccg tgtccccagt tccgtctcag ctacctggac tccaggaccc 
3240 
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ffo^^ 9 agacctggca STt^agggag gctgtgctgt gtgtccccct gcaggtgtga 

C ?^o CCtgC tctttcctcc cccgccaggt gtggccccgc ctgctctttc ctcccccacc 
3 3 6 0 

a ?!lon ggCCC cacctgctct ttcctccccc cccaaggtgt ggccccacct gttctttcct 

•P "S A V 

C ?^on gCC9a Sgtgtgaccc cacctgctct ttcctccctc ccagtatggc cccacctgct 
3 4 80 

ctttcctccc ccgaggtgag gccccgcctg ctctttcctc ccatgggagc cgctgaggcg 
3540 

tgcgcacctg ggcacaggtt ggggctctgc aggatgagga agacaggcca atcccttccc 
3 600 

tcccagaagc tggccgccca gcaggaggga ctgaggccag actcatgtcc agcaaggaac 

3 660 

gtgtggtgtg tcccctggga agtctctggg ccctgggaag agggaaggtg cacgtcctgg 

gatggttgcg gggccctgtt ttgggagaca aaggggtaga gggtctgtct tgggcccccc 
3780 

Ca | a Q tCtag ccc Sagcagt gcagccacct actgccccac ctcagagaag tgcagcggga 

aggaggctgg aggtggtgcg gcgctgcctc ,gggtgtctgc gtgaatgagc gcggccaagg 
3900 

accagtgcca cctcatggca aagagctccc gcagtgtttg ttagagtgca catcctacgt 
3960 - - - 

» 

gcccactggc acacacacgt gctcacatac atgtccgcgt acaggcgtac acatgcacgc 

4 0 2 0 

ttgcacacat gcacacagac cacatagcac acatgtgcac tgaccacacc tgtatagacc 
4080 " 

atgcacagta cacatacgtg catacacatg cc tgcataca ggcatacaca tgcacgctta 
4 _L 4 0 

catgtacacg tgcacagatc acacacatgc acacacgtgt agctcacaca cagtatacac 
4200 

atacacaagt gcacagacca cacacagcac taacacatgc acacacaaag tgcataggcc 
4260 

acacagcaca tgcacacagg tgcacagacc acacagcaca cacaagtgca cagagcacac 
4320 

tgcacacatg cacacacaca cgcgtgcatg cacactcctc gcacttccag ccttggagcc 

Page 108 



43g0 SequencesSSCPre-f ile7August03 . ST25 . txt 

°4440 9tCtC tggtctttct c tttgaccct gctgagtgta agctgcctgg ggaggggcta 
°4500 a9taa ttgtggcttt a ggggtcgtg gtgatgctgg aatgtcaagc gccgtcgtgg 
9 4560 CC9aC tgtCC9ggct cct <ratccgc agtggcagag cgccaggcag agccaatcag 
gg B^ CgtgC ^^tccc ccccacagcc tggcagccat ccagaggagg ggctctacca 

g ^?n Caa " t ^ cccc ^9tg tctgtatggg tgtccggttg ggtcctgtgt ttggtctgcc 
4 680 « 

C 4740 9gtgg Ctgggccctc ct ^^tggg tggctcagcc tcgaatccca ggccccagcc 
c ^aggtg ctgctgcctg ttgtggtttc ctggcccagc ttctccttct ccctctgcat 
a ^o?o Cacag tcc< 3tgagtc ttccagctgc caccacggct gggacacgct gggggagggc 

4860 

t 4Q9n CCatg cctcct ^ cac acagccgtct gagcagggca ggtgccaaca ccccccaccg 

gagacacgct gcccctcagc gatgccccta ccttttgggg ggcctcgtct caagcccccc 
4980 

cttggaggct gaaatcaccc caggcactgt gagggcttct ccagggggac accctttgag 
5040 

ctgtgggtct gatcacccca agtcccgcac acggaggaga ggcacagcca gggcgtgtgg 
5100 

tttaatgttt gccccttcgg ggctggaggt ctcagtgttt ctagattcca gaccctgctg 
5160 

ccagagagac ctgctgccgg agagaagggg aggaggactc cagctgggct cggtccccca 
522 0 

cagtcaggga cccccataaa ggacaccccc ttctctctag aaagagctgg gctctcagct 

52 80 

atttctagtt gcttcccaga agccgaggag cagaaggagc tgtgagagct ttgcagaaac 

53 4 0 

gcccttgtcc ccgccctcct gagctatgaa tgccgtacag agcagaggct ggggcattgg 
5400 

C ^?^ tCaCa ^gttgatgct gcacagcccc attgacacaa accctcaaag cagacgtgag 
5 4 60 
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agggacggtt cacaaagctt ggacctgccg tggagggtgc ccggcagacg tggcgtgaga 
5520 

gggacggctc acgaggcttg gacctgctgt ggagggtgcc cagcagacgt ggtgtgagag 
5580 

gaacggctca cgagacttgg acctggtgga gggtgcccag cagacgtggt gtgagaggga 
5640 

cggctcacag ggcttggacc ggagagagat ggctcatgag acttggacct gccgtggagg 
5700 

gtgcccagca gacgtggtat gagagggatg gctcacgagg cttggacctg gtggagggtg 
57 60 

cccggcagac gtgtgagagg gacggttcac aaggcttgga cctgccatgg agggtgccca 
582 0 

gcagacgtgg tgtgagaggg acagctcacg aggcttggac ctgccgtgga gggtgcccag 
5880 

cagggggctg agctctgagg ggtgggtgct cagtgcacgg gtgcccccag tgtcctctga 

tcctgtccgg tgcctccccc aacccccaca cccatgcaga actcccaggt cacatgcacg 
6000 """* • 

tatgtccagg gcatgggggt ggcgtgaaga ggcctggtca gggcctttag gggctgcagg 
60 60 

acggaatggc cacctgggga gcctgtgtgg ctgtgccggg cagccatcct gcattcccac 
6120 

ccagcgcgca gtctccacct cggccccagc aaagcgctaa gcagccggag agacagccag 
6180 

ggcggcttcc tgaaggatgt gggatggtgg actccggggt cgagggaata cgcaggttcc 
6240 

tgtcctccgg gagacctaga gaagctgcac acccaggagc tttccatgac ccgggagcat 
6300 

gagtgaatgg ggggttccag tttgctgaac tttgctgtct tgtaagggtg ggggctgacg 
63 6 0 

gccgaccctg ggaggaggtg acaccgcagg gggaggttgt gggcaacggt ggaggaggag 
6420 

agacgggagg ggaccatttg ggatggaggg gcctcttcag agttttaaaa ggcgtttgtg 
6480 

gggtggagtt gagtgtgctc tgggcttgga cacttgccgt ggtgcccctg gctggccgag 
6540 

Page 110 



SeguencessscPre-f ile7August03.. ST25 . txt 
g 6GO0 C " Ct Ctggccaggg cc ccgtcctg agaggtcctc agcgtctgac tctcggccag 

9 6660 a9 ° aa ggaggggccg Stccccgggg ctaccaggca ggcacgtgca catcgccatc 

g 6720 aC9 ° C aactccgcct gggttttaca aagtcgttgc cttaatgcat gtggacagga 

a 6780 CtgaS 9tcgccccat gccccctggc tgtgccaggt acggacgccc tggaccctgc 

9 tQ^n 9gtgg ggCg9gcgag aggcccaagg gacgggctcc agagacacgc gcagggcagg 
v 8 4 0 

a f?n? tctCa c 99aggggtc tcgcactgag gcgcccagag ctggtggtcc cgctggacgc 
6900 

C f«^ CtCtg ccc ^9 atcc acacggccca cgtgtgcccg ccatgcccgc gccccacgcc 
6960 

a ^n?n agtCt tccatcctct ggccgtgacg gtggctgcag cttccccatt tgcgccgttg 

cctctggctg tctgcacttt tgttcatgct ccaaagaaca tttcataatg ccttcagtac 
7080 

cgacgtacac ttctgaccat tttgtatgtg tccttgtgcc gtagtgacca ggcctttttt 
7140 

t 9|tggatgt gttaccccgc acacttcaat ctcaactttg tgcaccgtcc attttctagg 

gatagacgcc cagggaatga actctagttt tctaacagat tagctgagat attaacttac 
7260 

tcacacggac aggttgatgc cagagccgta. agaatgcgcc agtgcgggtt tgcgggggac 

ttcgggtgtg gggtcctgcg gccgcgatgg ccgtggaagg ttctggggat ccctgctgcc 
73 8 0 

acggggacga gttcggacgc caggtggacc tgtgcactca gtaaaacgca gtgattc 
7 43 7 

<210> 37 

<211> 7437 

<212> DNA 

<213> Homo sapiens 

<400> 37 

gctgagcctg agcccgaccc ggggcgcctc ccgccaggca ccatggtgca gaagtcgcgc 
60 
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aacggcggcg tataccccgg cccgagcggg gagaagaagc tgaaggtggg cttcgtgggg- 

ctggaccccg gcgcgcccga ctccacccgg gacggggcgc tgctgatcgc cggctccgag 
180 " 

gcccccaagc gcggcagcat cctcagcaaa cctcgcgcgg gcggcgcggg cgccgggaag 
240 

ccccccaagc gcaacgcctt ctaccgcaag ctgcagaatt tcctctacaa cgtgctggag 

cggccgcgcg gctgggcgtt catctaccac gcctacgtgt tcctcctggt tttctcctgc 
3 60 

ctcgtgctgt ctgtgttttc caccatcaag gagtatgaga agagctcgga gggggccctc 
420 

tacatcctgg aaatcgtgac tatcgtggtg tttggcgtgg agtacttcgt gcggatctgg 
480 ~ 

gccgcaggct gctgctgccg gtaccgtggc tggagggggc ggctcaagtt tgcccggaaa 
540 

ccgttctgtg tgattgacat catggtgctc atcgcctcca ttgcggtgct ggccgccggc 
600 

tcccagggca acgtctttgc cacatctgcg ctccggagcc tgcgcttcct gcagattctg 
660 

cggatgatcc gcatggaccg gcggggaggc acctggaagc tgctgggctc tgtggtctat 
720 

gcccacagca aggagctggt cactgcctgg tacatcggct tcctttgtct catcctggcc 
780 

tcgttcctgg tgtacttggc agagaagggg gagaacgacc actttgacac ctacgcggat 
840 

gcactctggt ggggcctgat cacgctgacc accattggct acggggacaa gtacccccag 
900 

acctggaacg gcaggctcct tgcggcaacc ttcaccctca tcggtgtctc cttcttcgcg 
960 

ctgcctgcag gcatcttggg gtctgggttt gccctgaagg ttcaggagca gcacaggcag 
1020 " ' ' " 

aagcactttg agaagaggcg gaacccggca gcaggcctga tccagtcggc ctggagattc 
1080 

tacgccacca acctctcgcg cacagacctg cactccacgt ggcagtacta cgagcgaacg 
1140 

gtcaccgtgc ccatgtacag ttcgcaaact caaacctacg gggcctccag acttatcccc 
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°???n gaacC a 9rctggagct gctgaggaac ctcaagagta aatctggact cgctttcagg 

12 60 

a ?? gacCCCC c 9ccggagcc gtctccaagc cagaaggtca gtttgaaaga tcgtgtcttc 
1320 

t ??oo CCCCC ^aggcgtggc tgccaagggg aaggggtccc cgcaggccca gactgtgagg 

13 80 

C ?ddn aCC ° a gcgccgacca ?T a 9Cctcgag gacagcccca gcaaggtgcc caagagctgg 
agcttcgggg accgcagccg ggcacgccag gctttccgca tcaagggtgc cgcgtcacgg 

cagaactcag aagaagcaag cctccccgga gaggacattg tggatgacaa gagctgcccc 
1560 

tgcgagtttg tgaccgagga cctgaccccg ggcctcaaag tcagcatcag agccgtgtgt 
1620 

gtcatgcggt tcctggtgtc caagcggaag ttcaaggaga gcctgcggcc ctacgacgtg 
1680 

atggacgtca tcgagcagta ctcagccggc cacctggaca tgctgtcccg aattaacraac 
174 0 ~ 

C 1800 gtCCa gagtggacca 9 atc 9tgggg cggggcccag cgatcacgga caaggaccgc 

accaagggcc cggccgaggc ggagctgccc gaggacccca gcatgatggg acggctcggg 
1860 

aaggtggaga agcaggtctt gtccatggag aagaagcggg acttcctggt gaatatctac 
192 0 

a i980 gC9ga tgggcatccc cccgacagag sccgaggcct actttggggc caaagagccg 

gagccggcgc cgccgtacca cagcccggaa gacagccggg agcatgtcga cagqcacgqc 
2 04 0 

tgcattgtca agatcgtgcg ctccagcagc tccacgggcc agaagaactt ctcggcgccc 
2100 

ccggccgcgc cccctgtcca gtgtccgccc tccacctcct ggcagccaca gagccacccg 

cgccagggcc acggcacctc ccccgtgggg gaccacggct ccctggtgcg catcccgccg 
2220 

ccgcctgccc acgagcggtc gctgtccgcc tacggcgggg gcaaccgcgc cagcatggag 
2 2 S 0 
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^ctgcggc aggaggacac cccgggctgc aggccccccg aggggaccct gcgggacagc 

g 2400 9tCCa tctccatccc g tcc 9tggac cacgaggagc tggagcgttc cttcagcggc 

ttcagcatct cccagtccaa ggagaacctg gatgctctca acagctgcta cgcggccgtg 
2460 

gcgccttgtg ccaaagtcag gccctacatt gcggagggag agtcagacac cgactccgac 
2520 

ctctgtaccc cgtgcgggcc cccgccacgc tcggccaccg gcgagggtcc ctttggtgac 
2 580 

gtgggctggg ccgggcccag gaagtgaggc ggcgctgggc cagtggaccc gcccgcggcc 
2 640 

ctcctcagca cggtgcctcc gaggttttga ggcgggaacc ctctggggcc cttttcttac 

agtaactgag tgtggcggga agggtgggcc ctggaggggc ccatgtgggc tgaaggatgg 
2760 

gggctcctgg cagtgacctt ttacaaaagt tattttccaa cagggcactc ccaggccctg 
2820 

tcgccattga ggtgcctccg ctgggctgtc tcctcacccc tccctgtgct ggagcctgtc 

2 880 

ccaaaaaggt gccaactggg aggcctcgga agccactgtc caggctccca ctgcctgtct 
2940 

gctctgttcc caaaggcagc gtgtgtggcc tcgggccctg cggtggcatg aagcatccct 
3000 

tctggtgtgg gcatcgctac gtgttttggg ggcagcgttt cacggcggtg cccttgctgt 

3 060- 

ctcccttggg ctggctcgag cctggggtcc atgtcccttt gccgtcccgt catggggcag 
3120 

ggaatccata gcggggccca caggcagggg tatgagtgcg tcccacccaa cgcagcacca 
3180 

gccccggcca ccgctccccg tgtccccagt tccgtctcag ctacctggac tccaggaccc 
3240 

tggagaaggg agacctggca gtggagggag gctgtgctgt gtgtccccct gcaggtgtga 
3300 ~ " ~ 

ccccgccfcgc tctttcctcc cccgccaggt gtggccccgc ctgctctttc ctcccccacc 
3360 
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a 3420 g9CCC cacct9Ctct ttcctccccc cccaaggtgt ggccccacct gttctttcct 

C ?^n gCCga 99tgtgaccc cacctgctct ttcctccctc ccagtatggc cccacctgct 

J 4 80 

C ^n CtCC ° ccgaggtgag gccccgcctg ctctttcctc ccatgggagc cgctgaggcg 

•3 3 ft w 

t ??nn aCCtg 99 caca ^gtt ggggctctgc aggatgagga agacaggcca atcccttccc 
3 600 

tcccagaagc tggccgccca gcaggaggga ctgaggccag actcatgtcc agcaaggaac 
3 660 

Qtgtggtgtg tcccctggga agtctctggg ccctgggaag agggaaggtg cacgtcctgg 
3 7 2 0 

♦ 

gatggttgcg gggccctgtt ttgggagaca aaggggtaga gggtctgtct tgggcccccc 
3780 

cagactctag cccgagcagt gcagccacct actgccccac ctcagagaag tgcagcggga 
3840 

a ffoo gCtgg aggtggtgcg g c 9Ctgcctc gggtgtctgc gtgaatgagc gtggccaagg 

accagtgcca cctcatggca aagagctccc gcagtgtttg ttagagtgca catcctacgt 
3 9 60 

gcccactggc acacacacgt gctcacatac atgtccgcgt acaggcgtac acatgcacgc 
402 0 

ttgcacacat gcacacagac cacatagcac acatgtgcac tgaccacacc tgtatagacc 
4080 

atgcacagta cacatacgtg catacacatg cctgcataca ggcatacaca tgcacgctta 
4140 

catgtacacg tgcacagatc acacacatgc acacacgtgt agctcacaca cagtatacac 
4200 

a ^acacaagt gcacagacca cacacagcac taacacatgc acacacaaag tgcataggcc 

acacagcaca tgcacacagg tgcacagacc acacagcaca cacaagtgca cagagcacac 
432 0 ~ " ~ 

tgcacacatg cacacacaca cgcgtgcatg cacactcctc gcacttccag ccttggagcc 

cttctgtctc tggtctttct ctttgaccct gctgagtgta agctgcctgg ggaggggcta 
444 0 

caaggagtaa ttgtggcttt aggggtcgtg gtgatgctgg aatgtcaagc gccgtcgtgg 
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9 4560 CC9aC tgtccgggct cctggtccgc agtggcagag cgccaggcag agccaatcag 
9 ^oo C9tgC fc 9 ccc ttccc ccccacagcc tggcagccat ccagaggagg ggctctacca 

4 oZ 0 

9 ^?n Caagg ^ccccggtg tctgtatggg tgtccggttg ggtcctgtgt ttggtctgcc 

4 680 , *"** - ~ 

c ^ggaggtgg ctgggccctc ctgggatggg tggctcagcc tcgaatccca ggccccagcc 

caggcaggtg ctgctgcctg ttgtggtttc ctggcccagc ttctccttct ccctctgcat 
4800 

aaaatcacag tccgtgagtc ttccagctgc caccacggct gggacacgct gggggagggc 
4 860 * " " ~ 

t ^^ ccatg cctcctgcac acagccgtct gagcagggca ggtgccaaca ccccccaccg 
4 y 2. 0 

gagacacgct gcccctcagc gatgccccta ccttttgggg ggcctcgtct caagcccccc 
4980 

C 5040 aggCt gaaatcaccc caggcactgt gagggcttct ccagggggac accctttgag 

ctgtgggtct gatcacccca agtcccgcac acggaggaga ggcacagcca gggcgtgtgg 
5100 

tttaatgttt gccccttcgg ggctggaggt ctcagtgttt ctagattcca gaccctgctg 
516 0 

ccagagagac ctgctgccgg agagaagggg aggaggactc cagctgggct cggtccccca 
522 0 

cagtcaggga cccccataaa ggacaccccc ttctctctag aaagagctgg gctctcagct 
5280 

atttctagtt gcttcccaga agccgaggag cagaaggagc tgtgagagct ttgcagaaac 
5340 

gcccttgtcc ccgccctcct gagctatgaa tgccgtacag agcagaggct ggggcattgg 

caagatcaca ggttgatgct gcacagcccc attgacacaa accctcaaag cagacgtgag 
5460 

agggacggtt cacaaagctt ggacctgccg tggagggtgc ccggcagacg tggcgtgaga 
5520 

gggacggctc acgaggcttg gacctgctgt ggagggtgcc cagcagacgt ggtgtgagag 
5580 
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9 5640 gCtCa cgagacttgg a cctggtgga gggtgcccag cagacgtggt gtgagaggga 

C R7nn° aCag racttggacc ggagagagat ggctcatgag acttggacct gccgtggagg 
9 5760° agCa gacgtggtat: 9 a 9 a ggg at g gctcacgagg cttggacctg gtggagggtg 

C co?n CagaC Stgtgagagg gacggttcac aaggcttgga cctgccatgg agggtgccca 
5820 

gcagacgtgg tgtgagaggg acagctcacg aggcttggac ctgccgtgga gggtgcccag 
5880 

cagggggctg agctctgagg ggtgggtgct cagtgcacgg gtgcccccag tgtcctctga 
5940 

tcctgtccgg tgcctccccc aacccccaca cccatgcaga actcccaggt cacatqcacq 
6000 

ta 060 CCagg gcatgggggt TOCfiTtsraaga ggcccggtca gggcctttag gggctgcagg 

acggaatggc cacctgggga gcctgtgtgg ctgtgccggg cagccatcct gcattcccac 
6120 

ccagcgcgca gtctccacct cggccccagc aaagcgctaa gcagccggag agacagccag 
6180 

ggcggcttcc tgaaggatgt gggatggtgg actccggggt cgagggaata cgcaggttcc 
6240 

tgtcctccgg gagacctaga gaagctgcac acccaggagc tttccatgac ccgggagcat 
6300 

gagtgaatgg ggggttccag tttgctgaac tttgctgtct tgtaagggtg ggggctgacg 
6360 

* 

gccgaccctg ggaggaggtg acaccgcagg gggaggttgt gggcaacggt ggaggaggag 
. 6420 

a | a cgggagg ggaccatttg ggatggaggg gcctcttcag agttttaaaa ggcgtttgtg 

gggtggagtt gagtgtgctc tgggcttgga cacttgccgt ggtgcccctg gctggccgag 
6540 

gagactggct ctggccaggg ccccgtcctg agaggtcctc agcgtctgac tctcggccag 
6600 

gcgccagcaa ggaggggccg gtccccgggg ctaccaggca ggcacgtgca catcgccatc 
6660 
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9 6720 aCgCC aactccgcct ^gttttaca aagtcgttgc cttaatgcat gtggacagga 

a 6780 Ct9ag gfccgccccat gccccctggc tgtgccaggt acggacgccc tggaccctgc 
g 6 a 40 ggt " ggCgggcgag 999cccaagg gacgggctcc agagacacgc gcagggcagg 
agg o gtCtCa Cggaggggtc tcgcactgag gcgcccagag ctggtggtcc cgctggacgc 
C 6960 CtCt9 cccgggatcc acacggccca cgtgtgcccg ccatgcccgc gccccacgcc 
a 7020 a9tCt tccatcctct ggccgtgacg gtggctgcag cttccccatt tgcgccgttg 
C ^on ggCtg tct 9 cacttt tgttcatgct ccaaagaaca tttcataatg ccttcagtac 

7 080 

C 7 a 40 taCaC ttctgaccat tCCgtatgtg tccttgtgcc gtagtgacca ggcctttttt 

tgg ^ gatgt 9 ttacccc 9c acacttcaat ctcaactttg tgcaccgtcc attttctagg 
7 2 00 

ga ~ agacgcc ca S99 aat Sra actctagttt tctaacagat tagctgagat attaacttac 
7260 

t S a ? acggac B ^9^<3^tgc cagagccgta agaatgcgcc agtgcgggtt tgcgggggac 
7320 " 

ttcgggtgtg gggtcctgcg gccgcgatgg ccgtggaagg ttctggggat ccctgctgcc 
7380 ~ -> 

acggggacga gttcggacgc caggtggacc tgtgcactca gtaaaacgca gtgattc 
7437 

<210> 38 

<211> 169 

<212> DNA 

<213> Homo sapiens 

<400> 38 

aCt 60 tCCCC ccgctgaacc a g ct gg a gct gctgaggaac ctcaagagta aatctggact 

cgctttcagg tcagctgggg agctccaggt ggggcgggtg ggcgtctcag tcctgggggc 
120 

cccagctgcc cacagaagac acgccaggac agtgccccag ggactccag 
169 

r 
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<210> 39 

<211> 203 

<212> DNA 

<213> Homo sapiens 

<400> 39 

atgccgggac aggtgacccc atggcggaag acaggggcta tgggaatgac ttccccatcg 
60 

aagacatgat ccccaccctg aaggccgcca tccgagccgt caggtaatgc ccccacggtc 
120 

ccacctgtgc ctgtgtgcct cccccactcc agctcaactc ccacaggaag gggcttataa 
180 

aattatcttg cactttggga agg 
203 



<210> 40 

<211> 232 

<212> DNA 

<213> Homo sapiens 

<4.00> 40 

aattctacaa ttccgtctct ataaaaaaaa attcaaggag actttgaggc cttacgatgt 
60 

gaaggatgtg attgagcagt attctgccgg gcatctcgac atgctttcca ggataaagta 
120 

ccttcagacg aggtgagaca gtcacatctg gagggactgc actcccctca aagccctatg 
180 

aaccttagag tttaaggtga gaggtattca gaaataattc aaaatgcagg ga 
232 



<210> 


41 


<211> 


1925 


<212> 


DNA 


<213> 


Homo 


<400> 


41 


ggtccattcg 


60 





aaaagacaat tcttttaatc agagttagta atgtg'gacag tacaaaatcg agagagtctg 
120 

gggcttctct ctttccctgt gatgattacc atggtctgtt gtgcacacag caccaatgaa 
180 

cccagcaaca tgccatacgt gaaagagaca gtggacagat tgctcaaagg atatgacatt 
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240 

Cg 300 9CggC cggacttcgg a 9ggcccccc gtcgacgttg ggatgcggat cgatgtcgcc 

agcatagaca tggtctccga agtgaatatg gattatacac tcaccatgta tttccagcag 
360 

tcttggaaag acaaaaggct ttcfctattct ggaatcccac tgaacctcac cctagacaat 
420 

agggtagctg accaactctg ggtaccagac acc tactttc tgaatgacaa gaaatcattt 
480 

gtgcatgggg tcacagtgaa aaatcgaatg attcgactgc atcctgatgg aacagttctc 
540 

tatggactcc gaatcacaac cacagctgca tgtatgatgg atcttcgaag atatccattg 
600 

gatgagcaga actgcaccct ggagatcgaa agttatggct ataccactga tgacattaaa 
660 

ttttactgga atggaggaga aggggcagtc actggtgtta ataaaatcga acttcctcaa 
720 

ttttcaattg ttgactacaa gatggtgtct aagaaggtgg agttcacaac aggagcgtat 
780 

ccacgactgt cactaagttt tcgtctaaag agaaacatfcg gttacttcat tttgcaaacc 
840 

tacatgcctt ctacactgat tacaattctg tcctgggtgt ctttttggat caactatgat 
900 

gcatctgcag ccagagtcgc actaggaatc acgacagtgc ttacaatgac aaccatcagc 
960 

acccacctca gggagaccct gccaaagatc ccttatgtca aagcgattga tatttatctg 
1020 

atgggttgct ttgtgtttgt gttcctggct ctgctggagt atgcctttgt aaattacatc 
1080 

ttctttggga aaggccctca gaaaaaggga gctagcaaac aagaccagag tgccaatgag 
1140 

aagaataaac tggagatgaa taaagtccag gtcgacgccc acggtaacat tctcctcagc 
12 00 

accctggaaa tccggaatga gacgagtggc tcggaagtgc tcacgagcgt gagcgacccc 
1260 

aaggccacca tgtactccta tgacagcgcc agcatccagt accgcaagcc cctgagcagc 
1320 
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C ???n 9gCCt acgggcgcgc cctggaccgg cacggggtac ccagcaaggg gcgcatccgc 

a ??/n tgCCt cccagctcaa agtcaagatc cccgacttga ctgatgtgaa ttccatagac 

■JL "3 \J 

aagtggtccc gaatgttttt ccccatcacc ttttctcttt ttaatgtcgt ctattggctt 
1 5 0 0 

tactatgtac actgaggtct gttctaatgg " ftccatttag actactttcc tcttctattg 
1560 3 

t ^o« taacc tta caggtcc ccaacagcga tactgctgtt tctcgaggta agagattcag 
1620 

C ??o^ Caatt ^gttttaggt cttgcatatc agttttatta ctgcaccatg tttacttcaa 
1680 

aaagacaaaa caaaaaaaaa attatttttc cagtctaccg tggtccaggt tatcagctct 
174 0 

ttaagagctc tattaattgc catgtttaca aacaaacaca aagagagaag ttagacaggt 
1800 " *"** 

agatctttag cagtcttttc tagtttccct ggatttcact gatttatttt ttagggaaaa 
18 60 " 3 ^ 

tgaaaagagg accttgctgt ccgcctgcac tgcttcctgg taaactataa ( caaacttatg 
1 "2 0 

c tgcc 
1925 

<210> 42 

<211> 1925 

<212> DNA 

<213> Homo sapiens 

<400> 42 

ggtccattcg ggaattactg cccagcagcc gactaagttg cattccttga atcttcgcag 
60 

aaaagacaat tcttttaatc agagttagta. atgtggacag tacaaaatcg agagagtctg 
120 

gggcttctct ctttccctgt gatgattacc atggtctgtt gtgcacacag caccaatgaa 

cccagcaaca tgccatacgt gaaagagaca gtggacagat tgctcaaagg atatgacatt 
240 

C9 300 g ° 9gC cggacttcgg a ^ggcccccc gtcgacgttg ggatgcggat cgatgtcgcc 
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agcatagaca tggtctccga agtgaatatg gattatacac tcaccatgta tttccagcag 

tC !i9n gaaag acaaaa ^ ct ttcttattct ggaatcccac tgaacctcac cctagacaat 

ag ??A a9Ct9 accaactct S ggtaccagac acctactttc tgaatgacaa gaaatcattt 
480 

gtgcatgggg tcacagtgaa aaatcgaatg attcgactgc atcctgatgg aacagttctc 
540 

tatggactcc gaatcacaac cacagctgca tgtatgatgg atcttcgaag atatccactg 
600 

gatgagcaga actgcaccct ggagatcgaa agttatggct ataccactga tgacattgaa 
660 

ttttactgga atggaggaga aggggcagtc actggtgtta ataaaatcga acttcctcaa 
720 

ttttcaattg ttgactacaa gatggtgtct aagaaggtgg agttcacaac aggagcgtat 
780 

ccacgactgt cactaagttt tcgtctaaag agaaacattg gttacttcat tttgcaaacc 
840 

tacatgcctt ctacactgat tacaattctg tcctgggtgt ctttttggat caactatgat 
900 

gcatctgcag ccagagtcgc actaggaatc acgacagtgc ttacaatgac aaccatcagc 
960 

acccacctca gggagaccct gccaaagatc ccttatgtca aagcgattga tatttatcta 
1020 * 

atgggttgct ttgtgtttgt gttcctggct ctgctggagt atgcctttgt aaattacatc 
1080 

ttctttggga aaggccctca gaaaaaggga gctagcaaac aagaccagag tgccaatgag 

aagaataaac tggagatgaa taaagtccag gtcgacgccc acggtaacat tctcctcagc 
12 00 

accctggaaa tccggaatga gacgagtggc tcggaagtgc tcacgagcgt gagcgacccc 

12 60 " ~ =7 = 3 

aaggccacca tgtactccta tgacagcgcc agcatccagt accgcaagcc cctgagcagc 

13 20 

cgcgaggcct acgggcgcgc cctggaccgg cacggggtac ccagcaaggg gcgcatccgc 
13 80 
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aggcgtgcct cccagctcaa agtcaagatc cccgacttaa ctgatgtgaa ttccatagac 
1 4 4 0 

a f?n 9gtCCC gaatgttttt ccccatcacc ttttctcttt ttaatgtcgt ctattggctt 

A J \J \J 

tactatgtac actgaggtct gttctaatgg ttccatttag actactttcc tcttctattg 
1560 a 

t ^" taacC ttaca ^gtcc ccaacagcga tactgctgtt tctcgaggta agagattcag 
162 0 

ccatccaatt ggttttaggt cttgcatatc agttttatta ctgcaccatg tttacttcaa 
1680 

a i740° aaaa caaaaaaaaa attatttttc cagtctaccg tggtccaggt tatcagctct 

ttaagagctc tattaattgc catgtttaca aacaaacaca aagagagaag ttagacaggt 
1800 

a ?n^ fct ^ a9 cag tcttttc tagtttccct ggatttcact gatttatttt ttagggaaaa 
1860 

tgaaaagagg accttgctgt ccgcctgcac tgcttcctgg taaactataa caaacttatg 
152 0 

ctgcc 
1925 

<210>. 43 

<211> 1925 

<212> DNA 

<213> Homo sapiens 

<400> 43 

ggtccattcg ggaattactg cccagcagcc gactaagttg cattccttga atcttcgcag 
6 0 

aaajigacaat tcttttaatc agagttagta atgtggacag tacaaaatcg agagagtctg 

gggcttctct ctttccctgt gatgattacc atggtctgtt gtgcacacag caccaatgaa 
180 

cccagcaaca tgccatacgt gaaagagaca gtggacagat tgctcaaagg atatgacatt 
240 

cgcttgcggc cggacttcgg agggcccccc gtcgacgttg ggatgcggat cgatgtcgcc 
^ 0 ^5 

agcatagaca tggtctccga agtgaatatg gattatacac tcaccatgta tttccagcag 
3 60 . ~ 
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tcttggaaag acaaaaggct ttcttattct ggaatcccac tgaacctcac cctagacaat 
420 

agggtagctg accaactctg ggtaccagac acctactttc tgaatgacaa gaaatcattt 
4 80 

gtgcatgggg tcacagtgaa aaatcgaatg attcgactgc atcctgatgg aacagttctc 
540 

tatggactcc gaatcacaac cacagctgca tgtatgatgg atcttcgaag atatccactg 

gatgagcaga actgcaccct ggagatcgaa agttatggct ataccactga tgacattgaa 
660 

ttttactgga atggaggaga aggggcagtc actggtgtta ataaaatcga acttcctcaa 
720 

ttttcaattg ttgactacaa gatggtgtct aagaaggtgg agttcacaac aggagcgtat 

7 80 - . - - — - - 

ccacgactgt cactaagttt tcgtctaaag agaaacattg gttacttcat tttgcaaacc 
840 

tacatgcctt ctacactgat tacaattctg tcctgggtgt ctttttggat caactatgat 
900 

gcatctgcag ccagagtcgc actaggaatc acgacagtgc ttacaatgac aaccatcagc 
960 ~ ~ 

acccacctca gggagaccct gccaaagatc ccttatgtca aagcgattga tatttatctg 
102 0 

atgggttgct ttgtgtttgt gttcctggct ctgctggagt atgcttttgt aaattacatc 
1080 

ttctttggga aaggccctca gaaaaaggga gctagcaaac aagaccagag tgccaatgag 

aagaataaac tggagatgaa taaagtccag gtcgacgccc acggtaacat tctcctcaac 
1200 

accctggaaa tccggaatga gacgagtggc tcggaagtgc tcacgagcgt gagcgacccc 
1260 

aaggccacca tgtactccta tgacagcgcc agcatccagt accgcaagcc cctgagcagc 
132 0 

cgcgaggcct acgggcgcgc cctggaccgg cacggggtac ccagcaaggg gcgcatccgc 
1380 

aggcgtgcct cccagctcaa agtcaagatc cccgacttga ctgatgtgaa ttccatagac 
1440 

aagtggtccc gaatgttttt ccccatcacc ttttctcttt ttaatgtcgt ctattggctt 
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*j \j \j 

t 1560 t9taC actgaggtct gttctaatgg ttccatttag actactttcc tcttctattg 
t 1620 taaCC ttacaggtcc ccaacagcga tactgctgtt tctcgaggta agagattcag 
C 1680 Caatt ggttttaggt cttgcatatc agttttatta ctgcaccatg tttacttcaa 
a i740° aaaa caaaaaaaaa attatttttc cagtctaccg tggtccaggt tatcagctct 
t ^ aa ^ a9CtC tattaatt 9 c catgtttaca aacaaacaca aagagagaag ttagacaggt 

loUU ~" 

a i860 ttta9 cagtcttttc tagtttccct ggatttcact gatttatttt ttagggaaaa 
tgaaaagagg accttgctgt ccgcctgcac tgcttcctgg taaactataa caaacttatg 

ctgcc 
1925 

<210> 44 

<211> 1536 

<212> DNA 

<213> Homo sapiens 

<400> 44 

tgaattcgtg agatggcgag ctccacggca ccatggcccc gaagctgctg ctcctcctct 
o 0 

9C i20 ttCtC gggcttgcac ffcgcggtcca gaaaggtgga agaggatgaa tatgaagatt 
Ca ^on tcaaa ccaaaa 9tgg gtcttggctc caaaatccca agacaccgac gtgactctta 
ttctcaacaa gttgctaaga gagtatgata aaaagctgag gccagatatt ggaataaaac 
cgaccgtaat tgacgttgac atttatgtta acagcattgg tcctgtgtca tcaataaaca 

wJ \J \J 

tggaatacca aattgacata ttttttgctc agacctggac agatagtcgc cttcgattca 
360 

acagcacaat gaaaattctt actctgaaca gcaacatggt ggggttaatc tggatcccag 

A U 

■ 

acaccatctt ccgcaattct aaaaccgcag aggctcactg gatcadcaca cccaatcagc 
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*1 8 0 

tcctccggat ttggaatgac gggaaaatcc tttacacttt gaggctcacc atcaatgctg 
ag 600 Ca9Ct gcagctgcac aacttcccca tggacgaaca ctcctgcccg ctgattttct 
CC 660 tatg£r Ctatcccaaa Saagaaatga tttatagatg gagaaaaaat tcagtggagg 
cagctgacca gaaatcatgg cggctttatc agtttgactt catgggcctc agaaacacca 
Ca9 | aatCgt Sracaacgtct gcaggtgatt atgttgtcat gactatatat tttgaattga 

Qtagaagaat gggatacttc accattcaga catacattcc ctgtatactg actgtggttt 
840 

ta ann tg99t STtcattttgg atcaaaaaag atgctacgcc agcaagaaca gcattaggca 
900 

V 

tC 960 aC99t gctgaccatg accaccctga gcaccatcgc caggaagtcc ttgccacgcg 
t ?non taCgt 9 acc 9 ccat 9 gacctttttg tgactgtgtg cttcctgttt gtcttcgccg 
C ?SoH aC9ga 9 tat 9 cca <=c ctcaactact attccagctg tagaaaacca accaccacga 
aaaa S acaac atc 9 ttacta catccagatt cctcaagatg gattcctgag cgaataagcc 

•J* *L fi \J 

j 

ta ? aagCCCC ttccaactat tccctcctgg acatgaggcc accaccacct gcgatgatca 

i c U L) ^ 

c t"aaacaa ttcc 9tttac tggcaggaat ttgaagatac ctgtgtctat gagtgtctgg 
12 o 0 

a ^f?n aaaga ct ^ tca ^ a ^ c ttcttctgct gctatgaaga atgtaaatca ggatcctgga 

J. J / u 

ggaaagggcg tattcacata gacatcttgg agctggactc gtactcccgg gtctttttcc 
• 13 8 0 

ccacgtcctt cctgctcttt aacctggtct actgggttgg atacctgtat ctctaagtgt 

t ??nS aga9t g aa 9T a 9' t g aa gagcatttgg tacacacttg accttctgtc gtccccagac 
1500 

cagtagtgac caatcgggag tagcaaggaa ggacac 
153 6 
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<210> 45 

<211> 1843 

<212> DNA 

<213> Homo sapiens 

<400> 45 

gcacaattca gaggtaacag cgcctgcgtt ttctccatga taacatagac aaacagttgc 
60 

ct ^n aagCt gcagattgga tattgggaag caaatttggg tgtgaaatct tcagcaaagg 
agcacgcaga gtccatgatg gctcagacca agtgagtgag aggcagagcg agggcgcccc 

tC 9?n tCtg9 cs,cgcccgga ctcggactcg cagactcgcg ctggctccag tctctccacg 
~t \j 

at !^tctctc ccagactttt ccccggtctt aagagatcct gtgtccagag ggggccttag 
3 00 

ctgctccagc ccgcgatgag gaaaagtcca ggtctgtctg actgtctttg ggcctggatc 
360 

I 

Ct 4on tCtga gcacactgac tggaagaagc tatggacagc cgtcattaca agatgaactt 

•saw 

aaagacaata ccactgtctt caccaggatt ttggacagac tcctagatgg ttatgacaat 
480 

cgcctgagac caggattggg agagcgtgta accgaagtga agactgatat cttcgtcacc 
540 

agtttcggac ccgtttcaga ccatgatatg gaatatacaa tagatgtatt tttccgtcaa 
600 

agctggaagg atgaaaggtt aaaatttaaa ggacctatga cagtcctccg gttaaataac 
660 

Ct ?on" Caa gtaaaatccg gactccggac acatttttcc acaatggaaa gaagtcagtg 

gcccacaaca tgaccatgcc caacaaactc ctgcggatca cagaggatgg caccttgctg 
780 

tacaccatga ggctgacagt gagagctgaa tgtccgatgc atttggagga cttccctatg 
84 0 " 

gatgcccatg cttgcccact aaaatttgga agttatgctt atacaagagc agaagttgtt 
900 

tatgaatgga ccagagagcc agcacgctca gtggttgtag cagaagatgg atcacgtcta 
960 
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a i020 9tat9 acCttcttgg acaaacagta gactctggaa ttgtccagtc aagtacagga 

9a 080 t9tt9 ttatgaccac tcatttccac ttgaagagaa agattggcta ctttgttatt 
caaacatacc tgccatgcat aatgacagtg attctctcac aagtctcctt ctggctcaac 
a ?200 9tCt9 taccagcaag aactgtcttt ggagtaacaa ctgtgctcac catgacaaca 
t 1260 CatCa gtgccagaaa ^tccctccct aaggtggctt atgcaacagc tatggattgg 
t 1320 t9CC9 t9tgctatgc ctttgtgttc tcagctctga ttgagtttgc cacagtaaac 
ta 380 CaCta agagaggtta tgcatgggat ggcaaaagtg tggttccaga aaagccaaag 
a i44n aaa " atcctcttat taagaaaaac aacacttacg ctccaacagc aaccagctac 
a a500 taatt tggccagggg cgacccgggc ttagccacca ttgctaaaag tgcaaccata 

9*aacctaaag aggtcaagcc cgaaacaaaa ccaccagaac ccaagaaaac ctttaacagt 
1560 ' 33 

g !rf agcaaaa ttaac cgact gtcaagaata gccttcccgc tgctatttgg aatctttaac 
t 1 t f?n Ctact SWCtacgta tttaaacaga gagcctcagc taaaagcccc cacaccacat 

1680 

C ^ a9atCt tttactcaca ttctgttgtt cagttcctct gcactgggaa tttatttatg 
t ^nn aaCgC ag taattccc atctgccttt attgcctctg tcttaaagaa tttgaaagtt 

1800 

tccttatttt cataattcat ttaagacaag agacccctgt eta 
1843 y y 

<210> 46 

<211> 1843 

<212> DNA 

<213> Homo sapiens 

<400> 46 

gcacaattca gaggtaacag cgcctgcgtt ttctccatga taacatagac aaacagttgc 
60 
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i ctccaaagct gcagattgga tattgggaag caaatttggg tgtgaaatct tcagcaaagg 

agcacgcaga gtccatgatg gctcagacca agtgagtgag aggcagagcg aggacgcccc 

tC ~?^ tCtgg c 9 c 9cccgga ctcggactcg cagactcgcg ctggctccag tctctccacg 

2, 4 0 

attctctctc ccagactttt ccccggtctt aagagatcct gtgttcagag ggggccttag 

ctgctccagc ccgcgatgag gaaaagtcca ggtctgtctg actgtctttg ggcctggatc 
3 60 

Ct /9n tCtga gcacact ^ ac tggaagaagc tatggacagc cgtcattaca agatgaactt 

aaagacaata ccactgtctt caccaggatt ttggacagac tcctagatgg ttatgacaat 
480 

cgcctgagac caggattggg agagcgtgta accgaagtga agactgatat cttcgtcacc 
540 " 

agtttcggac ccgtttcaga ccatgatatg gaatatacaa tagatgtatt tttccgtcaa 
600 

r 

agctggaagg atgaaaggtt aaaatttaaa ggacctatga cagtcctccg gttaaataac 
660 

ctaatggcaa gtaaaatccg gactccggac acatttttcc acaatggaaa gaagtcagtg 
*7 2 0 

gcccacaaca tgaccatgcc caacaaactc ctgcggatca cagaggatgg caccttgctg 
780 

tacaccatga ggctgacagt gagagctgaa tgtccgatgc atttggagga cttccctatg 
840 ^ 

gatgcccatg cttgcccact aaaatttgga agttatgctt atacaagagc agaagttgtt 
900 

tatgaatgga ccagagagcc agcacgctca gtggttgtag cagaagatgg atcacgtcta 
960 

aaccagtatg accttcttgg acaaacagta gactctggaa ttgtccagtc aagtacagga 
1 020 

gaatatgttg ttatgaccac tcatttccac ttgaagagaa agattggcta ctttgttatt 
1080 

caaacatacc tgccatgcat aatgacagtg attctctcac aagtctcctt ctggctcaac 
1140 
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agagagtctg taccagcaag aactgtcttt ggagtaacaa ctgtgctcac catgacaaca 
1200 

ttgagcatca gtgccagaaa ctccctccct aaggtggctt atgcaacagc tatggattgg 
1260 

tttattgccg tgtgctatgc ctttgtgttc tcagctctga ttgagtttgc cacagtaaac 
1320 

fc ^tttcacta agagaggtta tgcatgggat ggcaaaagtg tggttccaga aaagccaaag 

aaagtaaagg atcctcttat taagaaaaac aacacttacg ctccaacagc aaccagctac 
1440 

acccctaatt tggccagggg cgacccgggc ttagccacca ttgctaaaag tgcaaccata 
1500 

gaacctaaag aggtcaagcc cgaaacaaaa ccaccagaac ccaagaaaac ctttaacaqt 
1560 

gtcagcaaaa ttgaccgact gtcaagaata gccttcccgc tgctatttgg aatctttaac 
1620 

ttagtctact gggctacgta tttaaacaga gagcctcagc taaaagcccc cacaccacat 
1680 

caatagatct tttactcaca ttctgttgtt cagttcctct gcactgggaa tttatttatg 
1740 

ttctcaacgc agtaattccc atctgccttt attgcctctg tcttaaagaa tttgaaagtt 
1800 

tccttatttt cataattcat ttaagacaag agacccctgt ctg 
1843 



<210> 47 

<211> 2189 

<212> DNA 

<213> Homo sapiens 

<400> 47 

cctagcgctc ctctccggct tccaccagcc catcgctcca cgctctcttg gctgctgcag 
60 

tctcggtctc tctctctctc tctctctctc tctctctctc tctctctctc tctctctctc 
120 

tctctctctc tctctcccaa gtttcctatc tcgtcaagat cagggcaaaa gaagaaaaca 
180 

ccgaattctg cttgccgttt cagagcggcg gtgatgaaga caaaattgaa catctacaac 
240 

Page 130 



[ 



SequencesSSCPre-f ile7Augus t03 . ST25 . txt 

atcgagttcc tgctttttgt tttcttggtg tgggaccctg ccaggttggt gctggctaac 
300 

atccaagaag atgaggctaa aaataacatt accatcttta cgagaattct tgacagactt 
3 60 

ctggatggtt acgataatcg gcttagacca ggactgggag acagtattac tgaagtcttc 
420 

actaacatct acgtgaccag ttttggccct gtctcagata cagatatgga atatacaatt 
480 

gatgttttct ttcgacaaaa atggaaagat gaacgtttaa aatttaaagg tcctatgaat 
540 

atccttcgac taaacaattt aatggctagc aaaatctgga ctccagatac cttttttcac 
600 

aatgggaaga aatcagtagc tcataatatg acaatgccaa ataagttgct tcgaattcag 

9 a ^gatggga ctctgctgta taccatgagg cttacagttc aagctgaatg cccaatgcac 

ttggaggatt tcccaatgga tgctcattca tgtcctctga aatttggcag ctatgcatat 
780 

a °840 t:tlCag aggtcactta tat ttggact tacaatgcat ctgattcagt acaggttgct 

cctgatggct ctaggttaaa tcaatatgac ctgctgggcc aatcaatcgg aaaggagaca 
900 

attaaatcca gtacaggtga atatactgta atgacagctc atttccacct gaaaagaaaa 
960 

attgggtatt ttgtgattca aacctatctg ccttgcatca tgactgtcat tctctcccaa 
1020 

gtttcattct ggcttaacag agaatctgtg . cctgcaagaa ctgtgtttgg agtaacaact 
1080 

. 

gtcctaacaa tgacaactct aagcatcagt gctcggaatt ctctccccaa agtggcttat 
1140 

gcaactgcca tggactggtt tattgctgtt tgttatgcat ttgtgttctc tgccctaatt 
1200 

gaatttgcaa ctgttaatta cttcaccaaa agaggatgga cttgggatgg gaagagtgta 
12 60 

gtaaatgaca agaaaaaaga aaaggcttcc gttatgatac agaacaacgc ttatgcagtg 
1320 

gctgttgcca attatgcccc gaatctttca aaagatccag ttctctccac catctccaag 
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1380 

agtgcaacca cgccagaacc caacaagaag ccagaaaaca agccagctga agcaaagaaa 
1 4 4 0 

actttcaaca gtgttagcaa aattgacaga atgtccagaa tagtttttcc agttt tgttt 
1500 

ggtaccttta atttagttta ctgggctaca tatttaaaca gagaacctgt attaggggtc 
1560 - * 

agtccttgaa ttgagaccca tgttatcttt gggatgtata gcaacattaa atttggtttg 
1620 

ttttgctatg tacagtctga ctaataactg ctaatttgtg atccaacatg tacagtatgt 
1680 

atatagtgac atagcttacc agtagacctt taafcggagac atgcatttgc taactcatgg 
1740 

aactgcagac agaaagcact ccatgcgaaa acagccattg ccttttttaa agatttaccc 
1800 

taggacctga tttaaagtga atttcaaatg acctgattaa tttcctattc ttccaaatga 
1860. 

gatgaaaatg gggatcctgt acaacccttt gtggaccctt ttggtttagc tcttaagtag 
1920 

gggtattttc tactgttgct taattatgat ggaagataac attgtcattc ctagatgaat 
1980 

cctttgaagt aacaaacatt gtatctgaca tcagctctgt tcatgagtgc tcagagtccc 
2 040 

tgctaatgta attggaagct tggtacacat aagaaaaact agagatttga aatctagcta 
2100 

tgaattactc tatatagtat ctatagccat gtacatatta cagcatgaca agctcgaaat 
2160 

aattatgagt cagcccgaaa gatgttaat 
2189 



<210> 48 

<211> 2352 

<212> DNA 

<213> Homo sapiens 

<400> 48 

gaagatgctg ttgagggccc tggagaaact tcagcagaac .agggcctctc cccttgcagg 
60 

ccgagccgcg gccctgcgcc ctccccctcc gcccagctcg gccaagggcg catttgctga 
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12 0 

gcgtctggcg gcctctaccg gagcacctct gcagagggcc gatcctccag cccagagacg 
18 0 

acatgtggcg ctcgggcgag tgccttgcag agagaggagt agcttgctgg ctttgaacgc 

gt ?nn 9tgg ° agatatttca gaaagcttca agaacaagct ggagaaggga agagttattc 
•j \j \j 

ctccatattc acctgcttca actactattc ttattgggaa tggacaatgg aatgttctct 

3 60 

ggttttatca tgatcaaaaa cctccttctc ttttgtattt ccatgaactt atccagtcac 
420 

tttggctttt cacagatgcc aaccagttca gtgaaagatg agaccaatga caacatcacg 

4 80 **" 

atatttacca ggatcttgga tgggctcttg gatggctacg acaacagact tcggcccggg 
540 

ctgggagagc gcatcactca ggtgaggacc gacatctacg tcaccagctt cggcccggtg 
600 " ~ ._ 



tC 660 CaCgg aaatggagta cac catagac gtgtttttcc gacaaagctg gaaagatg 



aa 



a9 720 tCg9t ttaaggggcc ca tgcagcgc ctccctctca acaacctcct tgccagcaag 

atctggaccc cagacacgtt cttccacaac gggaagaagt ccatcgctca caacatgacc 
780 

acgcccaaca agctgctgcg gctggaggac gacggcaccc tgctctacac catgcgcttg 
84 0 

accatctctg cagagtgccc catgcagctt gaggacttcc cgatggatgc gcacgcttgc 
900 

cctctgaaat ttggcagcta tgcgtaccct aattctgaag tcgtttacgt ctggaccaac 
960 

* 

ggctccacca agtcggtggt ggtggcggaa gatggctcca gactgaacca gtaccacctg 
1020 

atggggcaga cggtgggcac tgagaacatc agcaccagca caggcgaata cacaatcatg 
1080 

acagctcact tccacctgaa aaggaagatt ggctactttg tcatccagac ctaccttccc 
1140 

t ^cataatga ccgtgatctt atcacaggtg tccttttggc tgaaccggga atcagtccca 

Page 13 3 



SequencesSSCPre-£ile7August03 . ST25 . txt 
9 1260 93Ca9 tttttggggt c accacggtg ctgaccatga cgaccctcag catcagcgcc 

a i320 CtCtC tgcccaaagt ggcctacgcc accgccatgg actggttcat agctgtgtgc 
t f^f„ CttCg tcttctc 39C gctgatagag tttgccacgg tcaattactt taccaagaga 

13 80 ~" 

9 ?^n 9gCCt gggatggcaa aaaagccttg gaagcagcca agatcaagaa aaagcgtgaa 

Jl *i *x \J 

9 n?nn aCtaa ataa 9 tcaac aaacgctttt acaactggga agatgtctca ccccccaaac 

attccgaagg aacagacccc agcagggacg tcgaatacaa cctcagtctc agtaaaaccc 
1560 

t( T^? aagaga a ^acttctga aagcaaaaag acttacaaca gtatcagcaa aattgacaaa 

a i680 CC9aa tcgtattccc a g fc <=ttgttc ggcactttca acttagttta ctgggcaacg 

tatttgaata gggagccggt gataaaagga gccgcctctc caaaataacc ggccacactc 
1740 

C fonn CtCCa a ^ aca ^ ccat acttccagcg aaatggtacc aaggagaggt tttgctcaca 

18 00 

g ?f a « tctcc atat 9 t 9 a 9 c actatctttc aggaaatttt tgcatgttta ataatatgta 

186 0 " 

caaataatat tgccttgatg tttctatatg taacttcaga tgtttccaag atgtcccatt 

19 2 0 

ga Qo attC9a gcaaacaact ttctggaaaa acaggatacg atgactgaca ctcagatgcc 

Ca n^n tCata cg ttgatagt ttacaaacaa gatacgtata tttttaactg cttcaagtgt 

tacctaacaa tgttttttat acttcaaatg tcatttcata caaattttcc cagtgaataa 
2100 

atattttagg aaactctcca tgattattag aagaccaact atattgcgag aaacagagat 
2160 

cataaagagc acgttttcca ttatgaggaa acttggacat ttatgtacaa aatgaattgc 
2220 

ctttgataat tcttactgtt ctgaaatztag gaaagtactt gcatgatctt acacgaagaa 
2280 
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9t?n atagg caaactttta tgtaggcaga ttaataacag aaatacatca tatgttagat 



2340 

acacaaaata tt 
2352 



<210> 49 

<211> 2352 

<212> DNA 

<213> Homo sapiens 



<400> 49 

gaagatgctg ttgagggccc tggagaaact tcagcagaac agggcctctc cccttgcagg 
CC ? a n° C9gg gccctgcgcc ctccccctcc gcccagctcg gccaagggcg cgtfctgctga 

JL £ \J * 

9 °180 t9gCg gcctctaccg 9 a 9cacctct gcagagggcc gatcctccag cccagagacg 
acatgtggcg ctcgggcgag tgccttgcag agagaggagt- agcttgctgg ctttgaacgc 
gt ?nn gtggC agatatttca gaaagcttca agaacaagct ggagaaggga agagttattc 

*J w \J 

ctccatattc acctgcttca actactattc ttattgggaa tggacaatgg aatgttctct 

3 60 

gg ^ ttatca tsratcaaaaa cctccttctc ttttgtattt ccatgaactt atccagtcac 

4 2 0 

"J??? 0 "" caca S at 9Cc aaccagttca gtgaaagatg agaccaatga caacatcacg 
4 8 0 

atatttacca ggatcttgga tgggctcttg gatggctacg acaacagact tcggcccggg 

J T» w 

ctgggagagc gcatcactca ggtgaggacc gacatctacg tcaccagctt cggcccggtg 
600 * 

tccgacacgg aaatggagta caccatagac gtgtttttcc gacaaagctg gaaagatgaa 
660 

a9 ??n tC9gt ttaagg 99cc cat^cagcgc ctccctctca acaacctcct tgccagcaag 

atctggaccc cagacacgtt cttccacaac gggaagaagt ccatcgctca caacatgacc 
780 

acgcccaaca agctgctgcg gctggaggac gacggcaccc tgctctacac catgcgcttg 
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aC 900 CtCt9 Caga9tgccc ca tg<=agctt gaggacttcc cgatggatgc gcacgcttgc 

cctctgaaat ttggcagcta tgcgtaccct aattctgaag tcgtttacgt ctggaccaac 
9 1020 Ca ° Ca agtcggtggt ggtggcggaa gatggctcca gactgaacca gtaccacctg 
a ^nfn gCaga c 9Sr t 959 cac tgagaacatc agcaccagca caggcgaata cacaatcatg 

1080 

a ???n tCaCt tccacct ^ aa aaggaagatt .ggctactttg tcatccagac ctaccttccc 

ccgtgatctt atcacaggtg tccttttggc tgaaccggga atcagtccca 

1200 

gccaggacag tttttggggt caccacggtg ctgaccatga cgaccctcag catcagcgcc 
12 60 ~* 

a ?? a n CtCtC t ^ cccaaa ^ t ggcctacgcc accgccatgg actggttcat agctgtgtgc 

tatgccttcg tcttctcggc gctgatagag tttgccacgg tcaattactt taccaagaga 
1380 

SQCtgggcct gggatggcaa aaaagccttg gaagcagcca agatcaagaa aaagcgtgaa 
144 0 

gtcatactaa ataagtcaac aaacgctttt acaactggga agatgtctca ccccccaaac 
1500 

attccgaagg aacagacccc agcagggacg tcgaatacaa cctcagtctc agtaaaaccc 
1560 

tctgaagaga agacttctga aagcaaaaag acttacaaca gtatcagcaa aattgacaaa 
1620 

a i680° Cgaa tcgtattccc agtcttgttc ggcactttca acttagttta ctgggcaacg 

tatttgaata gggagccggt gataaaagga gccgcctctc caaaataacc ggccacactc 
1740 

ccaaactcca agacagccat acttccagcg aaatggtacc aaggagaggt tttgctcaca 
1800. 

gggactctcc atatgtgagc actatctttc aggaaatttt tgcatgttta ataatatgta 
1860 

caaataatat tgccttgatg tttctatatg taacttcaga tgtttccaag atgtcccatt 
1920 

gataattcga gcaaacaact ttctggaaaa acaggatacg atgactgaca ctcagatgcc 
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' 1980 

cagtatcata cgttgatagt ttacaaacaa gatacgtata tttttaactg cttcaaatat 
2040 

tacctaacaa tgttttttat acttcaaatg tcatttcata caaattttcc cagtgaataa 
2100 

atattttagg aaactctcca tgatbattag aagaccaact atattgcgag aaacagagat 
2160 

cataaagagc acgttttcca ttatgaggaa acttggacat ttatgtacaa aatgaattgc 
2220 

ctttgataat tcttactgtt ctgaaattag gaaagtactt gcatgatctt acacgaagaa 
2280 

atagaatagg caaactttta tgtaggcaga ttaataacag aaatacatca tatgttagat 
2340 

acacaaaata tt 
2352 



<210> 50 

<211> 2373 

<212> DNA 

<213> Homo sapiens 

<400> 50 

gcgcgcggcc cggggcgcgg cgcggagcgg agctgcaggg cggcggcggg agcgcggggc 
60 

gcaagagccg ctccgccggg agtgccgggg aagttcgcgc tggcagcatg gggcggtgac 
120 

gccgcaccgg ccttccgcgc ctgccagccg ggcgagagca ggcggaggag aaggaggatg 
180 

catcctcacc gacggctcgc ctcccggggc ccgcgcgcag gtgccttgca gagagaggag 
240 

tagcttgctg gctttgaacg cgtggcgtgg cagatatttc agaaagcttc aagaacaagc 
300 

tggagaaggg aagagttatt cctccatatt cacctgcttc aactactatt cttattggga 
360 

atggacaatg gaatgttctc tggttttatc atgatcaaaa acctccfctct cttttgtatt 
420 

tccatgaact tatccagtca ctttggcttt tcacagatgc caaccagttc agtgaaagat 
480 

gagaccaatg acaacatcac gatatttacc aggatcttgg atgggctctt ggatggctac 
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540 

gacaacagac ttcggcccgg gctgggagag cgcatcactc aggtgaggac cgacatctac 
600 

gtcaccagct tcggcccggt gtccgacacg gaaatggagt acaccataga cgtgtttttc 
660 

cgacaaagct ggaaagatga aaggcttcgg tttaaggggc ccatgcagcg cctccctctc 
720 

aacaacctcc ttgccagcaa gatctggacc ccagacacgt tcttccacaa cgggaagaaa 
780 

tccatcgctc acaacatgac cacgcccaac aagctgctgc ggctggagga cgacggcacc 
840 

ctgctctaca ccatgcgctt gaccatctct gcagagtgcc ccatgcagct tgaggacttc 
900 

ccgatggatg cgcacgcttg ccctctgaaa tttggcagct atgcgtaccc taattctgaa 
960 

gtcgtttacg tctggaccaa cggctccacc .aagtcggtgg tggtggcgga agatggctcc 
1020 

agactgaacc agtaccacct gatggggcag acggtgggca ctgagaacat cagcaccagc 
1080 

acaggcgaat acacaatcat gacagctcac ttccacctga aaaggaagat tggctacttt 
1140 

gtcatccaga cctaccttcc ctgcataatg accgtgatct tatcacaggt gtccttttgg 

12 00 

ctgaaccggg aatcagtccc agccaggaca gtttttgggg tcaccacggt gctgaccatg 
1260 

acgaccctca gcatcagcgc caggaactct ctgcccaaag tggcctacgc caccgccatg 
1320 

gactggttca tagctgtgtg ctatgccttc gtcttctcgg cgctgataga gtttgccacg 

13 80 

gtcaattact ttaccaagag aggctgggcc tgggatggca aaaaagcctt ggaagcagcc 
1440 

aagatcaaga aaaagcgtga agtcatacta aataagtcaa caaacgcfctt tacaactggg 
1500 

aagatgtctc accccccaaa cattccgaag gaacagaccc cagcagggac gtcgaataca 
1560 

acctcagtct cagtaaaacc ctctgaagag aagacttctg aaagcaaaaa gacttacaac 
1620 
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agtatcagca aaattgacaa aatgtcccga atcgtattcc cagtcttgtt cggcactttc 
1680 



aacttagttt actgggcaac gtatttgaat agggagccgg tgataaaagg agccgcctct 



ccaaaataac cggccacact cccaaactcc aagacagcca tacttccagc gaaatggtac 
1800 



caaggagagg ttttgctcac agggactctc catatgtgag cactatcttt caggaaattt 
1860 

tstgcatgttt aataatatgt acaaataata ttgccttgat gtttctatat gtaacttcag 
1920 



atgtttccaa gatgtcccat tgataattcg agcaaacaac tttctggaaa aacaggatac 
1980 



gatgactgac actcagatgc ccagtatcat acgttgatag tttacaaaca agatacgtat 
2040 

atttttaact gcttcaagtg ttacctaaca atgtttttta tacttcaaat gtcatttcat 
2100 



acaaattttc ccagtgaata aatattttag gaaactctcc atgattatta gaagaccaac 
2160 



tatattgcga gaaacagaga tcataaagag cacgttttcc attatgagga aacttggaca 
2220 



tttatgtaca aaatgaattg cctttgataa ttcttactgt tctgaaatta ggaaagtact 
2280 



tgcatgatct tacacgaaga aatagaatag gcaaactttt atgtaggcag attaataaca 
2340 

gaaatacatc atatgttaga tacacaaaat att 
2373 



<210> 51 

<211> 1974 

<212> DNA 

<213> Homo sapiens 

<400> 51 

cgcgcgggga agggaagaag aggacgaggt ggcgcagaga ccgcgggaga acacagtgct 
60 



tccggaggaa atctgctcgg tccccggcag ccgcgcttcc cctttgatgt tttggtacgc 
120 



cgtggccatg cgcctcacat tagaattact gcactgggca gactaagttg gatctcctct 
180 
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cttcagtgaa accctcaatt ccatcaaaaa ctaaagggat gtggagagtc cggaaaaggg 
240 

gctactttgg gatttggtcc ttccccttaa taatcgccgc tgtctgtgcg cagagtgtca 
3 0 ^) 

atgaccctag taatatgtcg ctggttaaag agacggtgga tagactcctg aaaggctatg 

3 60 

acattcgtct gagaccagat tttggaggtc cccccgtggc tgtggggatg aacattgaca 
42 0 

ttgccagcat cgatatggtt tctgaagtca atatggatta taccttgaca atgtactttc 

4 80 

aacaagcctg gagagataag aggctgtcct ataatgtaat acctttaaac ttgactctga 
540 * 

acaacagagt ggcagaccag ctctgggtgc ctgataccta tttcctgaac gataagaagt 
600 

catttgtgca cggagtgact gttaagaacc gcatgattcg cctgcatcct gatggcaccg 

6 60 

tcctttatgg actcagaatc acaaccacag ctgcctgcat gatggaccta aggaggtacc 
720 

cactggatga acaaaactgc accttggaaa ttgagagcta tggatacaca actgatgaca 

7 80 

ttgagtttta ctggcgtggc gatgataatg cagtaacagg agtaacgaaa attgaacttc 
84 0 

cacagttctc tattgtagat tacaaactta tcaccaagaa ggttgttttt tccacaggtt 
900 

cctatcccag gttatccctc agctttaagc ttaagagaaa cattggctac tttatcctgc 
960 , 

aaacatacat gccttccatc ctgattacca tcctctcctg ggtctccttc tggattaatt 
1020 

acgatgcttc agctgcaagg gtggcattag gaatcacaac tgtcctcaca atgaccacaa 
1080 

tcaacaccca cctccgggaa actctcccta aaatccccta tgtgaaggcc attgacatgt 
1140 

acctgatggg gtgctttgtc ttcgttttca tggcccttct ggaatatgcc ctagtcaact 
1200 

acatcttctt tgggaggggg ccccaacgcc aaaagaaagc agctgagaag gctgccagtg 
1260 
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ccaacaatga gaagatgcgc ctggatgtca acaagatttt ttataaagat attaaacaaa 
1320 

atgggaccca atatcgatcc ttgtgggacc ctactggaaa cctctcccca actagacgga 
1380 



ctaccaatta cgatttctct ctgtatacga tggaccccca tgagaacatc ttactgagca 
1440 

* 

ctctcgagat aaaaaatgaa atggccacat ctgaggctgt gatgggactt ggagacccca 
1500 

gaagcacaat gctagcctat gatgcctcca gcatccagta tcggaaagct gggttgccca 
1560 

ggcatagttt tggccgaaat gctctggaac gacatgtggc gcaaaagaaa agtcgcctga 
1620 

ggagacgcgc ctcccaactg aaaatcacca tccctgactt gactgatgtg aatgccatag 
1680 

atcggtggtc ccgcatattc ttcccagtgg ttttttcctt cttcaacatc gtctattggc 
1740 

tttactatgt gaactaaaca tggcctccca ctggaagcaa ggactagatt cctcctcaaa 
1800 

ccagttgtac agcctgatgt aggacttgga aaacacatca atccaggaca aaagtgacgc 
1860 

taaaatacct tagttgctgg cctatcctgt ggtccatttc ataccatttg ggttgcttct 
1920 

gctaagtaat gaatacacta aggtccttgt ggttttccag ttaaaacgca agta 
1974 



<210> 52 

<211> 1974 

<212> DNA 

<213> Homo sapiens 

<400> 52 

cgcgcgggga agggaagaag aggacgaggt ggcgcagaga ccgcgggaga acacagtgcc 
60 

tccggaggaa atctgctcgg tccccggcag ccgcgcttcc cctttgatgt tttggtacgc 
120 

cgtggccatg cgcctcacat tagaattact gcactgggca gactaagttg gatctcctct 
180 

cttcagtgaa accctcaatt ccatcaaaaa ctaaagggat gtggagagtc cggaaaaggg 
240 
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gctactttgg gatttggtcc ttccccttaa 
300 



atgaccctag taatatgtcg ctggttaaag 
360 



acattcgtct gagaccagat tttggaggtc 
420 



ttgccagcat cgatatggtt tctgaagtca 
480 



aacaagcctg gagagataag aggctgtcct 
540 



acaacagagt ggcagaccag ctctgggtgc 
600 

catttgtgca cggagtgact gttaagaacc 
660 



tcctttatgg actcagaatc acaaccacag 
720 



cactggatga acaaaactgc accttggaaa 
780 



ttgagtttta ctggcgtggc gatgataatg 
840 



cacagttctc tattgtagat tacaaactta 
900 



cctatcccag gttatccctc agctttaagc 
960 



aaacatacat gcct tccatc ctgattacca 
1020 

acgatgcttc agctgcaagg gtggcattag 
1080 



tcaacaccca cctccgggaa actctcccta 
1140 

acctgatggg gtgctttgtc ttcgttttca 
1200 



acatcttctt tgggaggggg ccccaacgcc 
1260 

ccaacaatga gaagatgcgc ctggatgtca 
1320 

atgggaccca atatcgatcc ttgtgggacc 



Lle7August03 .ST25 . txt 
taatcgccgc tgtctgtgcg cagagtgtca 

agacggtgga tagactcctg" aaaggctatg 

cccccgtggc tgtggggatg aacattgaca 

atatggatta taccttgaca atgtactttc 

ataatgtaat acctttaaac ttgactctgg 

ctgataccta tttcctgaac gataagaagt 

gcatgattcg cctgcatcct gatggcaccg 

ctgcctgcat gatggaccta aggaggtacc 

■ 

ttgagagcta tggatacaca actgatgaca 

cagtaacagg agtaacgaaa attgaacttc 

tcaccaagaa ggttgttttt tccacaggtt 

ttaagagaaa cattggctac tttatcctgc 

tcctctcctg ggtctccttc tggattaatt 

gaatcacaac tgtcctcaca atgaccacaa 

aaatccccta tgtgaaggcc attgacatgt 

tggcccttct ggaatatgcc ctagtcaact 

aaaagaaagc agctgagaag gctgccagtg 

acaagatttt ttataaagat attaaacaaa 

ctactggaaa cctctcccca actagacgga 
e 142 
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1380 * 



ctaccaatta cgatttctct ctgtatacga tggaccccca tgagaacatc ttactgagca 
144 0 



ctctcgagat aaaaaatgaa atggccacat ctgaggctgt gatgggactt ggagacccca 
1500 



gaagcacaat gctagcctat gatgcctcca gcatccagta tcggaaagct gggttgccca 
1560 



ggcatagttt tggccgaaat gctctggaac gacatgtggc gcaaaagaaa agtcgcctga 
162 0 



ggagacgcgc ctcccaactg aaaatcacca tccctgactt gactgatgtg aatgccatag 
1680 



atcggtggtc ccgcatattc ttcccagtgg ttttttcctt cttcaacatc gtctattggc 
1740 



tttactatgt gaactaaaca tggcctccca ctggaagcaa ggactagatt cctcctcaaa 
1800 



ccagttgtac agcctgatgt aggacttgga aaacacatca atccaggaca aaagtgacgc 
1860 



taaaatacct tagttgctgg cctatcctgt ,ggtccatttc ataccatttg ggttgcttct 
1920 

gctaagtaat gaatacacta aggtccttgt ggttttccag ttaaa&tgca agta 
1974 



<210> 53 

<211> 3282 

<212> DNA 

<213> Homo sapiens 

<400> 53 

gggacagggc tgaggatgag gagaaccctg gggacccaga agaccgtgcc ttgcctggaa 
60 

gtcctgcctg taggcctgaa ggacttgccc taacagagcc tcaacaacta cctggtgatt 
120 ~" 

cctacttcag ccccttggtg tgagcagctt ctcaacatga actacagcct ccacttggcc 
180 

ttcgtgtgtc tgagtctctt cactgagagg atgtgcatcc aggggagtca gttcaacgtc 
240 

gaggtcggca gaagtgacaa gctttccctg cctggcttbg agaacctcac agcaggatat 
300 

aacaaatttc tcaggcccaa ttttggtgga gaacccgtac agatagcgct gactctggac 
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360 



attgcaagta tctctagcat ttcagagagt aacatggact acacagccac catatacctc 
420 ~~ 

cgacagcgct ggatggacca gcggctggtg tttgaaggca acaagagctt cactctggat 
480 



gcccgcctcg tggagttcct ctgggtgcca gatacttaca ttgtggagtc caagaagtcc 
540 



ttcctccatg aagtcactgt gggaaacagg ctcatccgcc tcttctccaa tggcacggtc 
600 



ctgtatgccc tcagaatcac gacaactgtt gcatgtaaca tggatctgtc taaatacccc 
660 



atggacacac agacatgcaa gttgcagctg gaaagctggg gctatgatgg aaatgatgtg 
720 



gagttcacct ggctgagagg gaacgactct gtgcgtggac tggaacacct gcggcttgct 
780 



cagtacacca tagagcggta tttcacctta gtcaccagat cgcagcagga gacaggaaat 
840 



tacactagat tggtcttaca gtttgagctt cggaggaatg ttctgtattt cattttggaa 
900 

acctacgtfcc cttccacttt cctggtggtg ttgtcctggg tttcattttg gatctctctc 
960 



gattcagtcc ctgcaagaac ctgcattgga gtgacgaccg tgttatcaat gaccacactg 
1020 



atgatcgggt cccgcacttc tcttcccaac accaactgct tcatcaaggc catcgatgtg 
1080 



tacctgggga tctgctttag ctttgtgttt ggggccttgc tagaatatgc agttgctcac 
1140 

tacagttcct tacagcagat ggcagccaaa gataggggga caacaaagga agtagaagaa 
1200 



gtcagtatta ctaatatcat caacagctcc atctccagct ttaaacggaa gatcagcttt 
1260 



gccagcattg aaatttccag cgacaacgtt gactacagtg acttgacaat gaaaaccagc 
1320 



gacaagttca agtttgtctt ccgagaaaag atgggcagga ttgttgatta tttcacaatt 
1380 



caaaacccca gtaatgttga tcactattcc aaactactgt ttcctttgat ttttatgcta 
1440 
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g ?^n tgtat tttact 9ggc atactacatg tatttttgag tcaatgttaa atttcttgca 
1500 

tgccataggt cttcaacagg acaagataat gatgtaaatg gtattttagg ccaagtgtgc 
1560 

acccacatcc aatggtgcta caagtgactg aaataatatt tgagtctttc tgctcaaaga 
1620 

atgaagctcc aaccattgtt ctaagctgtg tagaagtcct agcattatag gatcttgtaa 
1680 

tagaaacatc agtccattcc tctttcatct taatcaagga cattcccatg gagcccaaga 
1740 

ttacaaatgt actcagggct gtttattcgg tggctccctg gtttgcattt acctcatata 
1800 

aagaatggga aggagaccat tgggtaaccc tcaagtgtca gaagttgttt ctaaagtaac 
1860 

tatacatgtt ttttactaaa tctctgcagt gcttataaaa tacattgttg cctatttagg 
1920 

gagtaacatt ttctagtttt tgtttctggt taaaatgaaa tatgggctta tgtcaattca 
1980 

r 

ttggaagtca atgcactaac tcaataccaa gatgagtttt taaataatga atattattta 
2040 

ataccacaac agaattatcc ccaatttcca ataagtccta tcattgaaaa ttcaaatata 
2100 

agtgaagaaa aaattagtag atcaacaatc taaacaaatc cctcggttct aagatacaat 
2160 

ggattcccca tactggaagg actctgaggc tttattcccc cactatgcat atcttatcat 
2220 

tttattatfca tacacacatc catcctaaac tatactaaag cccttttccc atgcatggat 
2280 

ggaaatggaa gatttttttg taacttgttc tagaagtctt aatatgggct gttgccatga 
2340 

aggcttgcag aattgagtcc attttctagc tgcctttatt cacatagtga tggggtacta 
2400 

aaagtactgg gttgactcag agagtcgctg tcattctgtc attgctgcta ctctaacact 
2460 

gagcaacact ctcccagtgg cagatcccct gtatcattcc aagaggagca ttcatccctt 
2520 
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2580 taatg atcaggaatg atgcttatta gaaaacaaac tgcttgaccc aggaacaagt 

9 ?£/n agCtt aa ^ taaactt ggctttgctc agatccctga tccttccagc tggtctgctc 
t 9 a nn ggCtt atcccgcatg agcaggagcg tgctggccct gagtactgaa ctttctgagt 
aacaatgaga cacgttacag aacctatgtt caggttgcgg gtgagctgcc ctctccaaat 

C ?o?n Ca9a9 at 9 cacat tc ctcggccagt ctcagccaac agtaccaaaa gtgatttttg 
282 0 

a oo?n 9CCag g ^ taaa ^ ct tccagttcag cctcagttat tttagacaat ctcgccatct 
2880 

ttaatttctt agcttcctgt tctaataaat gcacggcttt acctttcctg tcagaaataa 
2940 

accaaggctc taaaagatga fctfccccttct gtaactccct agagccacag gttctcattc 
3 0 0 0 

C ^n C ° Cat tatacttctc acaattcagt ttctatgagt ttgatcacct gattttttta 
3 060 

acaaaatatt tctaacggga atgggtggga gtgctggtga aaagagatga aatgtggttg ' 

tatgagccaa tcatatttgt gattttttaa aaaaagttta aaaggaaata tctgttctga 
3180 

aaccccactt aagcattgtt tttatataaa aacaatgata aagatgtgaa ctgtgaaata 
324 0 

aatataccat attagctacc caccaaaaaa aaaaaaaaaa aa 
3282 

<210> 54 

<211> 270 

<212>* DNA 

<213> Homo sapiens 

<400> 54 

ggtccattcg ggaattactg cccagcagcc gactaagtfcg cattccttga atcttcgcag 
60 

aaaagacaat tcttt taatc agagttagta atgtggacag tacaaaatcg agagagtctg 
120 

gggcttctct ctttccctgt gatgattacc atggtctgtt gtgcacacag gtgagctgct 
180 
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gt 240 tgaat Ctcgctctct ctc tctcttt ttttcttggt atgtttcttt ttacgtgtct 

gctggatcat gtatcttgtt gtttgggggt 
270 

<210> 55 

<211> 238 

<212> DNA 

<213> Homo sapiens 

<400> 55 

atggctatac cactgatgac attgaatttt actggaatgg aggagaaggg gcagtcactg 
6 0 

gtgttaataa aatcgaactt cctcaatttt caattgttga ctacaagatg gtgtctaaga 
12 0 

aggtggagtt cacaacaggt gaggttgttt cccccaaaat gtactagggg tgctgtgaaa 

* 

ggaagaagat ggttccaacc aaataatggg ctgattactt gtcttttgtt tctcaact 
23 8 

<210> 56 

<211> 345 ) 

<212> DNA 

<213> Homo sapiens 

<400> 56 

gcaccaataa ggaagtggca aatggcatct gtcctctcaa tcttgaaaaa ggaacttaat 
60 

agtggcgcct tcagctaagt gttgtctttc tctttcacag gaatcacgac agtgcttaca 
12 0 . 

atgacaacca tcagcaccca cctcagggag accctgccaa agatccctta tgtcaaagcg 

attgatattt atctgatggg tfcgctttgtg tttgtgttcc tggctctgct ggagtatgcc 
240 

tttgtaaatt acatcttctt tgggaaaggc cctcagaaaa agggagctag caaacaagac 
300 

cagagtgcca atgagaagaa taaactggag atgaataaag tccag 
3 45 ~ 



<210> 57 

<211> 190 

<212> DNA 

<213> Homo sapiens 
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<400> 57 

gggtgggccg gcgggcggcg ggcagggcgc ggggtgcgcg gggcgctggc ggctgagccg 
60 

cccctgaccc cgctctttgt gctccctgtc cctcccccag tgtgaacgat cccgggaaca 
120 

tgtcctttgt gaaggagacg gtggacaagc tgttgaaagg ctacgacatt cgcctaagac 
180 

ccgacttcgg 



190 




<210> 


58 


<211> 


253 


<212> 


DNA 


<213> 


Homo 


<400> 


58 


gtgcctatcc 


60 





ttcagactta tatgccctct atactgataa cgattctgtc gtgggtgtcc ttctggatca 
120 

attatgatgc atctgctgct agagttgccc tcggtatgtg ctatttttaa gtgatattta 
180 

aatgtaaagt aaccgtatca ttacagtatt gagagttcaa aggctgtagt tcaactacca 
240 

ttttttgaca gcg 
253 

<210> 59 

<211> 288 

<212> DNA 

<213> Homo sapiens 

<400> 59 

acggttactc atcggaggac atcgtctact actggtcgga gagccaggag cacatccacg 
60 

ggctggacaa gctgcagctg gcgcagttca ccatcaccag ctaccgcttc accacggagc 
120 

tgatgaactt caagtccggt aacatatgcc cgccgcccct tccgcafcgtg cccgccgccc 
180 

cttccgcgcg cgcccaccgc cccttccgcg cgcgcccacc gccccttccg cgtgcgcccg 
240 



Page 148 



SequencesSSCPre-f ile7August03 . ST25 . txt 
cc ^9tggttt tcatgctttt tagtcaagcc gcccgcaggc ccccaggg 

<210> 60 

<211> 288 

<212> DNA 

<213> Homo sapiens 

<400> 60 

acggttactc atcggaggac atcgtctact actggtcgga gagccaggag cacatccacg 
gg i2Q gaCaa gctgcagctg ^cgcagttca ccatcaccag ctaccgcttc .accacggagc 

t9 fon aaCtt caa ^tccggt aacatatgcc cgccgcccct tccgcatgtg cccgccgccc 
180 

cttccgcgcg cgcccaccgc cccttccgcg cgcgcccacc gccccttccg cgtgcgcccg 

cctgtggttt tcatgctttt tagtcaacgc gcccgcaggc ccccaggg 
288 

<210> 61 

<211> 288 

<212> DNA 

<213> Homo sapiens 

<400> 61 

acggttactc atcggaggac atcgtctact actggtcgga gagccaggag cacatccacg 
60 

^gctggacaa gctgcagctg gcgcagttca ccatcaccag ctaccgcttc accacggagc 

t^tgaactt caagtccggt aacatatgcc cgccgcccct tccgcatgtg cccgccgccc 

cttccgcgcg cgcccaccgc cccttccgcg tgcgcccgcc tgtggttttc atgcttttta 
24 0 

gtcaagcgcc cgcaggcccc cagggcctct ggggatgcag ctgggacg 
288 



<210> 62 

<211> 170 

<212> DNA 

<213> Homo sapiens 

<400> 62 

accggtgatg tggcttggtt tagtcatacc ctaaagattg ctcttaagag tgatcttgga 
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60 

tgcaaatgtt catgacagtt tcctagttat tttttcttct tttcttgtag ttactacatc 
120 

cagattcctc aagatggatt cctgagcgaa taagcctaca agccccttcc 
170 



<210> 63 

<211> 218 

<212> PRT 

<213> Homo sapiens 

<400> 63 

Met Gly Arg Leu Leu Ala Leu Val Val Gly Ala Ala Leu Val Ser Ser 
1 5 10 15 

Ala Cys Gly Gly Cys Val Glu Val Asp Ser Glu Thr Glu-Ala Val Tyr 

20 25 30 

Gly Met Thr Phe Lys He Leu Cys He Ser Cys Lys Arg Arg Ser Glu 
35 40 45 

Thr Asn Ala Glu Thr Phe Thr Glu Trp Thr Phe Arg Gin Lys Gly Thr 
50 55. 60 

Glu Glu Phe Val Lys He Leu Arg Tyr Glu Asn Glu Val Leu Gin Leu 
65 70 75 80 

Glu Glu Asp Glu His Phe Glu Gly Arg Val Val Trp Asn Gly Ser Arg 

85 90 95 

Gly Thr Lys Asp Leu Gin Asp Leu Ser He Phe He Thr Asn Val Thr 

100 105 110 

Tyr Asn His Ser Gly Asp Tyr Glu Cys His Val Tyr Arg Leu Leu Phe 
115 120 125 

Phe Glu Asn Tyr Glu His Asn Thr Ser Val Val Lys Lys He His He 
130 135 140 

Glu Val Val Asp Lys Ala Asn Arg Asp Met Ala Ser He Val Ser Glu 
145 150 155- 160 
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lie Met Met Tyr Val Leu lie Val Val Leu Thr lie Trp Leu Val Ala 

165 170 175 



r 

Glu Met He Tyr Cys Tyr Lys Lys He Ala Ala Ala Thr Glu Thr Ala 

180 185 190 



Ala Gin Glu Asn Ala Ser Glu Tyr Leu Ala He Thr Ser Glu Ser Lys 
195 200 205 



Glu Asn Cys Thr Gly Val Gin Val Ala Glu 
210 215 



<210> 64 

<211> 2005 

<212> PRT 

<213> Homo sapiens. 

<400> 64 

Met Ala Gin Ser Val Leu Val Pro Pro Gly Pro Asp Ser Phe Arg Phe 
15 10 15 



Phe Thr Arg Glu Ser Leu Ala Ala He Glu Gin Arg He Ala Glu Glu 

20 25 30 

Lys Ala Lys Arg Pro Lys Gin Glu Arg Lys Asp Glu Asp Asp Glu Asn 
35 40 45 

* 

Gly Pro Lys Pro Asn Ser Asp Leu Glu Ala Gly Lys Ser Leu Pro Phe 
50 55 60 

He Tyr Gly Asp He Pro Pro Glu Met Val Ser Val Pro Leu Glu Asp 
65 70 75 80 



Leu Asp Pro Tyr Tyr He Asn Lys Lys Thr Phe He Val Leu Asn Lys 

85 90 95 



Gly Lys Ala He Ser Arg Phe Ser Ala Thr Pro Ala Leu Tyr He Leu 

100 105 110 



Thr Pro Phe Asn Pro He Arg Lys Leu Ala He Lys He Leu Val His 
115 120 125 
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Ser Leu Phe Asn Met Leu lie Met Cys Thr lie Leu Thr Asn Cys Val 
130 135 140 

Phe Met Thr Met Ser Asn Pro Pro Asp Trp Thr Lys Asn Val Glu Tyr 
145 150 155 160 

Thr Phe Thr Gly lie Tyr Thr Phe Glu Ser Leu lie Lys lie Leu Ala 

165 170 175 

Arg Gly Phe Cys Leu Glu Asp Phe Thr Phe Leu Arg Asp Pro Trp Asn 

180 185 190 



Trp Leu Asp Phe Thr Val lie Thr Phe Ala Tyr Val Thr Glu Phe Val 
195 200 205 

Asp Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu Gin Ala 
210 . 215 220 

* 

Leu Lys Thr lie Ser Val lie Pro Gly Leu Lys Thr lie Val Gly Ala 
225 230 235 240 

Leu He Gin Ser Val Lys Lys Leu Ser Asp Val Met He Leu Thr Val 

245 250 255 



Phe Cys Leu Ser Val .Phe Ala Leu He Gly Leu Gin Leu Phe Met Gly 

260 265 270 

Asn Leu Arg Asn Lys Cys Leu Gin Trp Pro Pro Asp Asn Ser Ser Phe 
275 280 285 

Glu He Asn He Thr Ser Phe Phe Asn Asn Ser Leu Asp Gly Asn Gly 
290 295 300 

Thr Thr Phe Asn Arg Thr Val Ser He Phe Asn Trp Asp Glu Tyr He 
305 310 • 315 320 

Glu Asp Lys Ser His Phe Tyr Phe Leu Glu Gly Gin Asn Asp Ala Leu 

325 330 335 

Leu Cys Gly Asn Ser Ser Asp Ala Gly Gin Cys Pro Glu Gly Tyr He 

340 345 350 



Page 152 



SequencesSSCPre-file7August03 .ST25 . txt 



Cys Val Lys Ala Gly Arg Asn Pro Asn Tyr Gly Tyr Thr Ser Phe Asp 
355 360 365 

Thr Phe Ser Trp Ala Phe Leu Ser Leu Phe Arg Leu Met Thr Gin Astd 
370 375 380 

Phe Trp Glu Asn Leu Tyr Gin Leu Thr Leu Arg Ala Ala Gly Lys Thr 
385 390 .395 400 

Tyr Met lie Phe Phe Val Leu Val He Phe Leu Gly Ser Phe Tyr Leu 

405 410 415 

He Asn Leu He Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gin Asn 

420 425 430 

Gin Ala Thr Leu Glu Glu Ala Glu Gin Lys Glu Ala Glu Phe Gin Gin 
435 440 445 

Met Leu Glu Gin Leu Lys Lys Gin Gin Glu Glu Ala Gin Ala Ala Ala 
450 455 460 

Ala Ala Ala Ser Ala Glu Ser Arg Asp Phe Ser Gly Ala Gly Gly He 
465 470 475 480 

Gly Val Phe Ser Glu Ser Ser Ser Val Ala Ser Lys Leu Ser Ser Lys 

485 490 v 495 

Ser Glu Lys Glu Leu Lys Asn Arg Arg Lys Lys Lys Lys Gin Lys Glu 

500 505 510 

■ 

Gin Ser Gly Glu Glu Glu Lys Asn Asp Arg Val Leu Lys Ser Glu Ser 
515 520 525 

Glu Asp Ser He Arg Arg Lys Gly Phe Arg Phe Ser Leu Glu Gly Ser 
530 535 540 

Arg Leu Thr Tyr Glu Lys Arg Phe Ser Ser Pro His Gin Ser Leu Leu 
5 45 550 555 560 

Ser He Arg Gly Ser Leu Phe Ser Pro Arg Arg Asn Ser Arg Ala Ser 
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565 570 575 

Leu Phe Ser Phe Arg Gly Arg Ala Lys Asp He Gly Ser Glu Asn Asp 

580 585 590 

Phe Ala Asp Asp Glu His Ser Thr Phe Glu Asp Asn Asp Ser Arg Arg 
595 600 605 

Asp Ser Leu Phe Val Pro His Arg His Gly Glu Arg Arg His Ser Asn 
610 615 620 

Val Ser Gin Ala Ser Arg Ala Ser Arg Val Leu Pro He Leu Pro Met 
625 630 635 640 

Asn Gly Lys Met His Ser Ala Val Asp Cys Asn Gly Val Val -Ser Leu 

645 650 655 

Val Gly Gly Pro. Ser Thr Leu Thr Ser Ala Gly Gin Leu Leu Pro Glu 

660 665 670 

Gly Thr Thr Thr Glu Thr Glu lie Arg Lys Arg Arg Ser Ser Ser Tyr 
675 680 685 

His Val Ser Met Asp Leu Leu Glu Asp Pro Thr Ser Arg Gin Arg Ala 
690 695 700 

Met Ser He Ala Ser He Leu Thr Asn Thr Met Glu Glu Leu Glu Glu 
705 710 715 720 

Ser Arg Gin Lys Cys Pro Pro Cys Trp Tyr Lys Phe Ala Asn Met Cys 

725 730 735 

Leu He Trp Asp Cys Cys Lys Pro Trp Leu Lys Val Lys His Leu Val 

740 745 750 

Asn Leu Val Val Met Asp Pro Phe Val Asp Leu Ala He Thr He Cys 
755 760 765 

He Val Leu Asn Thr Leu Phe Met Ala Met Glu His Tyr Pro Met Thr 
770 775 780 
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Glu Glri Phe Ser Ser Val Leu Ser Val Gly Asn Leu Val Phe Thr Gly 
785 790 795 800 

He Phe Thr Ala Glu Met Phe Leu Lys He He Ala Met Asp Pro Tyr 

805 810 815 

Tyr Tyr Phe Gin Glu Gly Trp Asn lie Phe. Asp Gly Phe He Val Ser 

820 825 830 

Leu Ser Leu Met Glu Leu Gly Leu Ala Asn Val Glu Gly Leu Ser Val 
835 . 840 845 

Leu Arg Ser Phe Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp 
850 855 860 

Pro Thr Leu Asn Met Leu He Lys He He Gly Asn Ser Val Gly Ala 
865 870 875 880 

Leu Gly Asn Leu Thr Leu Val Leu Ala He He Val Phe He Phe Ala 

885 890 895 

Val Val Gly Met Gin Leu Phe Gly Lys Ser Tyr Lys Glu Cys Val Cys 

900 905 910 

Lys He Ser Asn Asp Cys Glu Leu Pro Arg Trp His Met His Asp Phe 
915 920 925 

Phe His Ser Phe Leu He Val Phe Arg Val Leu Cys Gly Glu Trp He 
930 935 940 

Glu Thr Met Trp Asp Cys Met Glu Val Ala Gly Gin Thr Met Cys Leu 
945 950 955 960 

Thr Val Phe Met Met Val Met Val He Gly Asn Leu Val Val Leu Asn 

965 970 975 

Leu Phe Leu Ala Leu Leu Leu Ser Ser Phe Ser Ser Asp Asn Leu Ala 

980 985 990 

Ala Thr Asp Asp Asp Asn Glu Met Asn Asn Leu Gin He Ala Val Gly 
995 1000 1005 
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■ 

Arg Met Gin Lys Gly lie Asp Phe Val Lys Arg Lys lie Arg Glu 
1010 1015 1020 

Phe He Gin Lys Ala Phe Val Arg Lys Gin Lys Ala Leu Asp Glu 
1025 1030 1035 

He Lys Pro Leu Glu Asp Leu Asn Asn Lys Lys Asp Ser Cys He 
1040 1045 1050 

Ser Asn His Thr Thr He Glu He Gly Lys Asp Leu Asn Tyr Leu 
1055 1060 1065 

Lys Asp Gly Asn Gly Thr Thr Ser Gly He Gly Ser Ser Val Glu 
1070 1075 1080 

Lys Tyr Val Val Asp Glu Ser Asp Tyr Met Ser Phe He Asn Asn 
1085 1090 1095 

Pro Ser Leu Thr Val Thr Val Pro He Ala Val Gly Glu Ser Asp 
1100 1105 1110 

* 

Phe Glu Asn Leu Asn Thr Glu Glu Phe Ser Ser Glu Ser Asp Met 
1115 1120 1125 

Glu Glu Ser Lys Glu Lys Leu Asn Ala Thr Ser Ser Ser Glu Gly 
1130 1135 1140 

Ser Thr Val Asp He Gly Ala Pro Ala Glu Gly Glu Gin Pro Glu 
1145 1150 1155 



Val Glu Pro Glu Glu Ser Leu Glu Pro Glu Ala Cys Phe Thr Glu 
1160 1165 1170 



Asp Cys Val Arg Lys Phe Lys Cys Cys Gin He Ser He Glu Glu 
1175 1180 1185 



Gly Lys Gly Lys Leu Trp Trp Asn Leu Arg Lys Thr Cys Tyr Lys 
1190 1195 1200 

He Val Glu His Asn Trp Phe Glu Thr Phe He Val Phe Met He 
1205 1210 1215 
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Leu Leu Ser Ser Gly Ala Leu Ala Phe Glu Asp lie Tyr lie Glu 
1220 ' 1225 1230 

Gin Arg Lys Thr lie Lys Thr Met Leu Glu Tyr Ala Asp Lys Val 
1235 1240 1245 

Phe Thr Tyr lie Phe He Leu Glu Met Leu Leu Lys Trp Val Ala 
1250 1255 1260 

Tyr Gly Phe Gin Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp 
1265 1270 1275 

Phe Leu He Val Asp Val Ser Leu Val Ser Leu Thr Ala Asn Ala 
1280 1285 1290 

Leu Gly Tyr Ser Glu Leu Gly Ala He Lys Ser Leu Arg Thr Leu 
1295 1300 1305 



Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser Arg Phe Glu Gly Met 
1310 1315 1320 

Arg Val Val Val Asn Ala Leu Leu Gly Ala He Pro Ser He Met 
1325 1330 1335 

Asn Val Leu Leu Val Cys Leu He Phe Trp Leu He Phe Ser He 
1340 1345 1350 

Met Gly Val Asn Leu Phe Ala Gly Lys Phe Tyr His Cys He Asn 
1355 1360 1365 

Tyr Thr Thr Gly Glu Met Phe Asp Val Ser Val Val Asn Asn Tyr 
1370 1375 1380 



Ser Glu Cys Lys Ala Leu He Glu Ser Asn Gin Thr Ala Arg Trp 
1385 1390 1395 



Lys Asn Val Lys Val Asn Phe Asp Asn Val Gly Leu Gly Tyr Leu 
1400 1405 1410 



Ser Leu Leu Gin Val Ala Thr Phe Lys Gly Trp Met Asp lie Met 
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1420 1425 



Tyr Ala Ala 
1430 



Glu Asp Asn 
1445 



Phe Gly Ser 
1460 



Asp Asn Phe 
1475 



Phe Met Thr 
1490 



Leu Gly Ser 
1505 



Lys Phe Gin 
1520 



Asp lie Ser 
1535 



Met Val Glu 
1550 



Tyr Trp lie 
1565 



Val Leu Lys 
1580 



Trp Asn lie 
1595 



Val Asp Ser 



Leu Tyr Met 



Phe Phe Thr 



Asn Gin Gin 



Glu Glu Gin 



Lys Lys Pro 



Gly Met Val 



lie Met lie 



Thr Asp Asp 



Asn Leu Val 



Leu lie Ser 



Phe Asp Phe 



Arg Asn Val 
1435 



Tyr Leu Tyr 
1450 



Leu Asn Leu 
1465 



Lys Lys Lys 
1480 



Lys Lys Tyr 
1495 



Gin Lys Pro 
1510 



Phe Asp Phe 
1525 



Leu lie Cys 
1540 



Gin Ser Gin 
1555 



Phe lie Val 
1570 



Leu Arg Tyr 
1585 



Val Val Val 
1600 



Glu Leu Gin 
1440 



Phe Val lie 
. 1455 



Phe lie Gly 
1470 



Phe Gly Gly 
1485 



Tyr Asn Ala 
1500 



lie Pro Arg 
1515 



Val Thr Lys 
1530 



Leu Asn Met 
1545 



Glu Met Thr 
1560 



Leu Phe Thr 
1575 



Tyr Tyr Phe 
1590 



lie Leu Ser 
1605 



Pro Lys Tyr 



Phe lie lie 



Val He He 



Gin Asp He 



Met Lys Lys 



Pro Ala Asn 



Gin Val Phe 



Val Thr Met 



Asn He Leu 



Gly Glu Cys 



Thr He Gly 



He Val Gly 



Met Phe Leu Ala Glu Leu He Glu Lys Tyr Phe Val Ser Pro Thr 
1610 1615 1620 
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Leu Phe Arg Val lie Arg Leu Ala Arg lie Gly Arg He Leu Arg 
1625 1630 1635 



Leu He Lys 
1640 



Met Met Ser 
1655 



Leu Val Met 
1670 



Tyr Val Lys 
1685 



Thr Phe Gly 
1700 



Ala Gly Trp 
1715 



Pro Asp Cys 
1730 



Gly Asp Cys 
1745 



Tyr He He 
1760 



Val He Leu 
1775 



Pro Leu Ser 
1790 



Lys Phe Asp 
1805 



Ser Asp Phe 
1820 



Gly Ala Lys 



Leu Pro Ala 



Phe He Tyr 



Arg Glu Val 



Asn Ser Met 



Asp Gly Leu 



Asp Pro Asp 



Gly Asn Pro 



He Ser Phe 



Glu Asn Phe 



Glu Asp Asp 



Pro Asp Ala 



Ala Asp Ala 



Gly He Arg 
1645 



Leu Phe Asn 
1660 



Ala He Phe 
1675 



Gly He Asp 
1690 



He Cys Leu 
1705 



Leu Ala Pro 
1720 



Lys Asp His 
1735 



Ser Val Gly 
1750 



Leu Val Val 
1765 



Ser Val Ala 
1780 



Phe Glu Met 
1795 



Thr Gin Phe 
1810 



Leu Asp Pro 
1825 
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Thr Leu Leu 
1650 



He Gly Leu 
1665 



Gly Met Ser 
1680 



Asp Met Phe 
1695 



Phe Gin He 
1710 



He Leu Asn 
1725 



Pro Gly Ser 
1740 



He Phe Phe 
1755 



Leu Asn Met 
1770 



Thr Glu Glu 
1785 



Phe Tyr Glu 
1800 



He Glu Phe 
1815 



Pro Leu Leu 
1830 



Phe Ala Leu 



Leu Leu Phe 



Asn Phe Ala 



Asn Phe Glu 



Thr Thr Ser 



Ser Gly Pro 



Ser Val Lys 



Phe Val Ser 



Tyr He Ala 



Ser Ala Glu 



Val Trp Glu 



Ala Lys Leu 



He Ala Lys 



I 



SequencesSSCPre-f ile7August03 . ST25 . txt 

Pro Asn Lys Val Gin Leu lie Ala Met Asp Leu Pro Met Val Ser 
1835 1840 1845 

Gly Asp Arg lie His Cys Leu Asp lie Leu Phe Ala Phe Thr Lys 
1850. 1855 1860 

Arg Val Leu Gly Glu Ser Gly Glu Met Asp Ala Leu Arg lie Gin 
1865 1870 1875 * 

Met Glu Glu Arg Phe Met Ala Ser Asn Pro Ser Lys Val Ser Tyr 
1880 * 1885 1890 



Glu Pro lie Thr Thr Thr Leu Lys Arg Lys Gin Glu Glu Val Ser 
1895 J 1900 1905 

Ala lie lie lie Gin Arg Ala Tyr Arg Arg Tyr Leu Leu Lys Gin 
1910 1915 1920 



Lys Val Lys Lys Val Ser Ser lie Tyr Lys Lys Asp Lys Gly Lys 
1925 1930 1935 



Glu Cys Asp Gly Thr Pro lie -Lys Glu Asp Thr Leu lie Asp Lys 
1940 1945 1950 



Leu Asn Glu Asn Ser Thr Pro Glu Lys Thr Asp Met Thr Pro Ser 
1955 1960 1965 



Thr Thr Ser Pro Pro Ser Tyr Asp Ser Val Thr Lys Pro Glu Lys 
1970 1975 1980 



Glu Lys Phe Glu Lys Asp Lys Ser Glu Lys Glu Asp Lys Gly Lys 
1985 1990 1995 



Asp lie Arg Glu Ser Lys Lys 
2000 2005 



<210> 65 

<211> 2005 

<212> PRT 

<213> Homo sapiens 



<400> 65 
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Met Ala Gin Ser Val Leu Val Pro Pro Gly Pro Asp Ser Phe Arg Phe 
15 10 15 

Phe Thr Arg Glu Ser Leu Ala Ala lie Glu Gin Arg lie Ala Glu Glu 

20 25 30 

Lys Ala Lys Arg Pro Lys Gin Glu Arg Lys Asp Glu Asp Asp Glu Asn 
35 40 45 

Gly Pro Lys Pro Asn Ser Asp Leu Glu Ala Gly Lys Ser Leu Pro Phe 
50 55 60' 

lie Tyr Gly Asp lie Pro Pro Glu Met Val Ser Val Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Tyr Tyr lie Asn Lys Lys Thr Phe He Val Leu Asn Lys 

85 90 • 95 

Gly Lys Ala He Ser Arg Phe Ser Ala Thr Pro Ala Leu Tyr He Leu 

100 105 110 

Thr Pro Phe Asn Pro He Arg Lys Leu Ala He Lys He Leu Val His 
115 120 125 

Ser Leu Phe Asn Met Leu He Met Cys Thr He Leu Thr Asn Cys Val 
130 135 140 

Phe Met Thr Met Ser Asn Pro Pro Asp Trp Thr Lys Asn Val Glu Tyr 
145 150 155 160 



Thr Phe Thr Gly He Tyr Thr Phe Glu Ser Leu He Lys He Leu Ala 

165 170 175 

Arg Gly Phe Cys Leu Glu Asp Phe Thr Phe Leu Arg Asp Pro Trp Asn 

180 185 190 

Trp Leu Asp Phe Thr Val He Thr Phe Ala Tyr Val Thr Glu Phe Val 
195 200 205 

Asp Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu Arg Ala 
210 " 215 220 
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Leu Lys Thr lie Ser Val lie Pro Gly Leu Lys Thr lie Val Gly Ala 
225 230 235 240 

Leu lie Gin Ser Val Lys Lys Leu Ser Asp Val Met lie Leu Thr Val 

245 250 255 

Phe Cys Leu Ser Val Phe Ala Leu He Gly Leu Gin Leu Phe Met Gly 

260 265 270 

Asn Leu Arg Asn Lys Cys Leu Gin Trp Pro Pro Asp Asn Ser Ser Phe 
275 280 285 

Glu He Asn He Thr Ser Phe Phe Asn Asn Ser Leu Asp Gly Asn Gly 
290 295 300 

4 

Thr Thr Phe Asn Arg Thr Val Ser He Phe Asn Trp Asp Glu Tyr He 
305 310 315 320 

Glu Asp Lys Ser His Phe Tyr Phe Leu Glu Gly Gin Asn Asp Ala Leu 

325 330 335 

Leu Cys Gly Asn Ser Ser Asp Ala Gly Gin Cys Pro Glu Gly Tyr He 

340 345 350 

Cys Val Lys Ala Gly Arg Asn Pro Asn Tyr Gly Tyr Thr Ser Phe Asp 
355 360 365 

Thr Phe Ser Trp Ala Phe Leu Ser Leu Phe Arg Leu Met Thr Gin Asp 
370 375 380 

i 

Phe Trp Glu Asn Leu Tyr Gin Leu Thr Leu Arg Ala Ala Gly Lys Thr 
385 390 395 ' 400 

Tyr Met He Phe Phe Val Leu Val He Phe Leu Gly Ser Phe Tyr Leu 

405 410 415 



He Asn Leu He Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gin Asn 

420 425 430 

Gin Ala Thr Leu Glu Glu Ala Glu Gin Lys Glu Ala Glu Phe Gin Gin 
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435 440 - 445 



Met Leu Glu Gin Leu Lys Lys Gin Gin Glu Glu Ala Gin Ala Ala Ala 
450 455 460 

Ala Ala Ala Ser Ala Glu Ser Arg Asp Phe Ser Gly Ala Gly Gly He 
465 470 475 480 



Gly Val Phe Ser Glu Ser Ser Ser Val Ala Ser Lys Leu Ser Ser Lys 

485 490 495 



Ser Glu Lys Glu Leu Lys Asn Arg Arg Lys Lys Lys Lys Gin Lys Glu 

500 505 510 

Gin Ser Gly Glu Glu Glu Lys Asn Asp Arg Val Leu Lys Ser Glu Ser 
515 520 525 

Glu Asp Ser He Arg Arg Lys Gly Phe Arg Phe Ser Leu Glu Gly Ser 
530 535 540 

Arg Leu Thr Tyr Glu Lys Arg Phe Ser Ser Pro His Gin Ser Leu Leu 
545 550 555 

Ser He Arg Gly Ser Leu Phe Ser Pro Arg Arg Asn Ser Arg Ala Ser 

565 570 ^ 

Leu Phe Ser Phe Arg Gly Arg Ala Lys Asp He Gly Ser Glu Asn Asp 

580 585 590 

Phe Ala Asp Asp Glu His Ser Thr Phe Glu Asp Asn Asp Ser Arg Arg 
595 600 605 

Asp Ser Leu Phe Val Pro His Arg His Gly Glu Arg Arg His Ser Asn 
610 615 . 620 

Val Ser Gin Ala Ser Arg Ala Ser Arg Val Leu Pro He Leu Pro Met 
625 630 635 

Asn Gly Lys Met His Ser Ala Val Asp Cys Asn Gly Val Val Ser Leu 

645 650 
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Val Gly Gly Pro Ser Thr Leu Thr Ser Ala Gly Gin Leu Leu Pro Glu 

660 665 • 670 



Gly Thr Thr Thr Glu Thr Glu He Arg Lys Arg Arg Ser Ser Ser Tyr 
675 680 685 

His Val Ser Met Asp Leu Leu Glu Asp Pro Thr Ser Arg Gin Arg Ala 
690 695 700 

Met Ser lie Ala Ser He Leu .Thr Asn Thr Met Glu Glu Leu Glu Glu 
705 710 " 715 720 

Ser Arg Gin Lys Cys Pro Pro Cys Trp Tyr Lys Phe Ala Asn Met Cys 

725 - 730 735 

Leu He Trp Asp Cys Cys Lys Pro Trp Leu Lys Val Lys His Leu Val 

740 745 750 

Asn Leu Val Val Met Asp Pro Phe Val Asp Leu Ala lie Thr He Cys 
755' 760 765 

He Val Leu Asn Thr Leu Phe Met Ala Met Glu His Tyr Pro Met Thr 
770 775 780 

Glu Gin Phe Ser Ser Val Leu Ser Val Gly Asn Leu Val Phe Thr Gly 
785 790 795 800 

He Phe Thr Ala Glu Met Phe Leu Lys He He Ala Met Asp Pro Tyr 

805 810 * 815 

Tyr Tyr Phe Gin Glu Gly Trp Asn He Phe Asp Gly Phe He Val Ser 

820 825 830 

Leu Ser Leu Met Glu Leu Gly Leu Ala Asn Val Glu Gly Leu Ser Val 
835 840 845 

Leu Arg Ser Phe Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp 
850 855 860 

Pro Thr Leu Asn Met Leu He Lys He He Gly Asn Ser Val Gly Ala 
865 870 875 880 
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Leu Gly Asn Leu Thr Leu Val Leu Ala lie He He Phe He Phe Ala 

885 890 895 

Val Val Gly Met Gin Leu phe Gly Lys Ser Tyr Lys Glu Cys Val Cys 

900 * 905 910 

Lys lie Ser Asn Asp Cys Glu Leu Pro Arg Trp His Met His Asp Phe 
915 920 925 

Phe His Ser Phe Leu lie Val Phe Arg Val Leu Cys Gly Glu Trp He 
930 935 94Q 

Glu Thr Met Trp Asp Cys Met Glu Val Ala Gly Gin Thr Met Cys Leu 
945 950 955 960 

Thr Val Phe Met Met Val Met Val He Gly Asn Leu Val Val Leu Asn 

965 970 975 

Leu Phe Leu Ala Leu Leu Leu Ser Ser Phe Ser Ser Asp Asn Leu Ala 

980 985 990 

Ala Thr Asp Asp Asp Asn Glu Met Asn Asn Leu Gin He Ala Val Gly 
995 1000 1005 

Arg Met Gin Lys Gly He Asp Phe Val Lys Arg Lys He Arg Glu 
1010 1015 1020 

Phe He Gin Lys Ala Phe Val Arg Lys Gin Lys Ala Leu Asp Glu 
1025 1030 1035 

lie Lys Pro Leu Glu Asp Leu Asn Asn Lys Lys Asp Ser Cys He 
1040 1045 1050 

Ser Asn His Thr Thr He Glu He Gly Lys Asp Leu Asn Tyr Leu 
1055 1060 1065 

Lys Asp Gly Asn Gly Thr Thr Ser Gly He Gly Ser Ser Val Glu 
1070 1075 1080 

Lys Tyr Val Val Asp Glu Ser Asp Tyr Met Ser Phe lie Asn Asn 
1085 1090 1095 
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Pr ° f?nn LSU Thr Val Thr Val Pro 11 e Ala Val G1 y Glu Ser Asp 
110 ° 1105 1110 

Phe Glu Asn Leu Asn Thr Glu Glu Phe Ser Ser Glu Ser Asp Met 
1115 1120 1125 

Glu Ser Lys Glu Lys Leu Asn Ala T hr Ser Ser Ser Glu Glv 

1130 1135 1140 

Ser Thr Val Asp He Gly Ala Pro Ala Glu Gly Glu Gin Pro Glu 
H45 1150 1155 

Val Glu Pro Glu Glu Ser Leu Glu Pro Glu Ala Cys Phe Thr Glu 
1160 1165 1170 

Asp Val Ar 3 L ^ s Phe Cys Cys Gin He Ser He Glu Glu 

1175 1180 1185 

Gly Lys Gly Lys Leu Trp Trp Asn Leu Arg Lys Thr Cys Tyr Lys 
1190 1195 1200 

He val Glu His Asn Trp Phe Glu Thr Phe He Val Phe Met He 
12 °5 1210 1215 

* 

Leu Leu Ser Ser Gly Ala Leu Ala Phe Glu Asp He Tyr Tie Glu 
1220 1225 1230 

Gin Arg Lys Thr He Lys Thr Met Leu Glu Tyr Ala Asp Lys Val 
1235 1240 1245 

Phe Thr Tyr He Phe lie Leu Glu Met Leu Leu Lys Trp Val Ala 
12 50 1255 1260 

■ 

Tyr Gly Phe Gin Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp 
1265 1270 1275 

Phe Leu He Val Asp Val Ser Leu Val Ser Leu Thr Ala Asn Ala 
1280 1285 1290 

Leu Gly Tyr Ser Glu Leu Gly Ala He Lys Ser Leu Arg Thr Leu 
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1300 1305 



Arg Ala Leu 
1310 



Arg Val Val 
1325 



Asn Val Leu 
1340 



Met Gly Val 
1355 



Tyr Thr Thr 
1370 



Ser Glu Cys 
1385 



Lys Asn Val 
1400 



Ser Leu Leu 
1415 



Tyr Ala Ala 
1430 



Glu Asp Asn 
1445 



Phe Gly Ser 
1460 



Asp Asn Phe 
1475 



Phe Met Thr 
1490 



Arg Pro Leu 



Val Asn Ala 



Leu Val Cys 



Asn Leu Phe 



Gly Glu Met 



Lys Ala Leu 



Lys Val Asn 



Gin Val Ala 



Val Asp Ser 



Leu Tyr Met 



Phe Phe Thr 



Asn Gin Gin 



Glu Glu Gin 



Arg Ala Leu 
1315 



Leu Leu Gly 
1330 



Leu lie Phe 
1345 



Ala Gly Lys 
1360 



Phe Asp Val 
1375 



He Glu Ser 
1390 



Phe Asp Asn 
1405 



Thr Phe Lys 
1420 



Arg Asn Val 
1435 



Tyr Leu Tyr 
1450 



Leu Asn Leu 
1465 



Lys Lys Lys 
1480 



Lys Lys Tyr 
1495 



Ser Arg Phe 
1320 



Ala He Pro 
1335 



Trp Leu lie 
1350 



Phe Tyr His 
1365 



Ser Val Val 
1380 



Asn Gin Thr 
1395 



Val. Gly Leu 
1410 



Gly Trp* Met 
1425 



Glu Leu Gin 
1440 



Phe Val He 
1455 



Phe He Gly 
1470 



Phe Gly Gly 
1485 



Tyr Asn Ala 
1500 



Glu Gly Met 
Ser He Met 
Phe Ser He 
Cys He Asn 
Asn Asn Tyr 
Ala Arg Trp 
Gly Tyr Leu 
Asp He Met 
Pro Lys Tyr 
Phe He He 
Val He He 
Gin Asp He 
Met Lys Lys 
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91* Ser Lys Lys Pro Gln L ^ s Pro Ile Pro A ^g Ala Asn 

1505 1510 1515 

Lys Phe Gin Gly Met Val Phe Asp Phe Val Thr Lys Gin Val Phe 
1520 1525 1530 

Asp Ile Ser Ile Met Ile Leu Ile Cys Leu Asn Met Val Thr Met 
1535 1540 1545 

Met Val Glu Thr Asp Asp Gin Ser Gin Glu Met Thr Asn Ile Leu 
"50 1555 1560 

Tyr Trp lie Asn Leu Val Phe Ile Val Leu Phe Thr Gly Glu Cys 
1565 1570 1575 

Val Leu Lys Leu Ile Ser Leu Arg Tyr Tyr Tyr Phe Thr Ile Gly 
1580 1585 1590 

Trp Asn Ile Phe Asp Phe Val Val Val Ile Leu Ser lie Val Gly 
1595 1600 1605 

Met Phe Leu Ala Glu Leu Ile Glu Lys Tyr Phe Val Ser Pro Thr 
1610 1615 1620 

Leu Phe Arg Val Ile Arg Leu Ala Arg Ile Gly Arg Ile Leu Arg 
1625 1630 . 1635 

Leu Ile Lys Gly Ala Lys -Gly Ile Arg Thr Leu Leu Phe Ala Leu 
1640 1645 1650 

Met Met Ser Leu Pro Ala Leu Phe Asn lie Gly Leu Leu Leu Phe 
1655 1660 1665 

Leu Val Met Phe Ile Tyr Ala lie Phe Gly Met Ser Asn Phe Ala 
' 1670 1675 • * 1680 

Tyr Val Lys Arg Glu Val Gly lie Asp Asp Met Phe Asn Phe Glu 
1685 1690 1695 

Thr Phe Gly Asn Ser Met Ile Cys Leu Phe Gin Ile Thr Thr Ser 
1700 1705 1710 



Page 168 



SequencesSSCPre-f ile7August03 . ST25 . txt 

Ala Gly Trp Asp Gly Leu Leu Ala Pro He Leu Asn Ser Gly Pro 
1715 1720 1725 



Pro Asp Cys Asp Pro Asp Lys Asp His Pro Gly Ser Ser Val Lys 
1730 1735 1740 



Gly Asp Cys Gly Asn Pro Ser Val Gly lie Pbe Phe Phe Val Ser 
1745 1750 1755 



Tyr He He He Ser Phe Leu Val Val Leu Asn Met Tyr He Ala 
1760 1765 1770 



Val He Leu Glu Asn Phe Ser Val Ala Thr Glu Glu Ser Ala Glu 
1775 1780 1785 



Pro Leu Ser Glu Asp Asp Phe Glu Met Phe Tyr Glu Val Trp Glu 
1790 1795 1800 



Lys Phe Asp Pro Asp Ala Thr Gin Phe He Glu Phe Ala Lys Leu 
1805 1810 1815 



Ser Asp Phe Ala Asp Ala Leu Asp Pro Pro Leu Leu lie Ala Lys 
1820 1825 1830 



Pro Asn Lys Val Gin Leu He Ala Met Asp Leu Pro Met Val Ser 
1835 1840 1845 

Gly Asp Arg He His Cys Leu Asp He Leu Phe Ala Phe Thr Lys 
1850 1855 1860 

Arg Val Leu Gly Glu Ser Gly Glu Met Asp Ala Leu Arg He Gin 
1865 1870 1875 



Met Glu Glu Arg Phe Met Ala Ser Asn Pro Ser Lys Val Ser Tyr 
1880 1885 1890 



Glu Pro He Thr Thr Thr Leu Lys Arg Lys Gin Glu Glu Val Ser 
1895 1900 1905 

Ala He He lie Gin Arg Ala Tyr Arg Arg Tyr Leu Leu Lys Gin 
1910 1915 1920 
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Lys Val Lys Lys Val Ser Ser lie Tyr Lys Lys Asp Lys Gly Lys 
1925 1930 1935 

Glu Cys Asp Gly Thr Pro lie Lys Glu Asp Thr Leu lie Asp Lys 
1940 1945 1950 

Leu Asn Glu Asn Ser Thr Pro Glu Lys Thr Asp Met Thr Pro Ser 
1955 I960 1965 

Thr Thr Ser Pro Pro Ser Tyr Asp Ser Val Thr Lys Pro Glu Lys 
1970 1975 1980 

Glu Lys Phe Glu Lys Asp Lys Ser Glu Lys Glu Asp Lys Gly Lys 
1985 1990 1995 

Asp lie Arg Glu Ser Lys Lys 
2000 2005 

<210> 66 

<211> 2005 

<212> PRT 

<213> Homo sapiens 

<400> 66 

Met Ala Gin Ser Val Leu Val Pro Pro Gly Pro Asp Ser Phe Arg Phe 
1 5 10 '15 



Phe Thr Arg Glu Ser Leu Ala Ala lie Glu Gin Arg lie Ala Glu Glu 

20 25 30 



Lys Ala Lys Arg Pro Lys Gin Glu Arg Lys Asp Glu Asp Asp Glu Asn 
35 40 45 

Gly Pro Lys Pro Asn Ser Asp Leu Glu Ala Gly Lys Ser Leu Pro Phe 
50 55 60 

lie Tyr Gly Asp lie Pro Pro Glu Met Val Ser Val Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Tyr Tyr lie Asn Lys Lys Thr Phe He Val Leu Asn Lys 

85 90 95 
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Gly Lys Ala lie Ser Arg Phe Ser Ala Thr Pro Ala Leu Tyr lie Leu 

100 105 110 



Thr Pro Phe Asn Pro lie Arg Lys Leu Ala He Lys He Leu Val His 
115 120 125 



Ser Leu Phe Asn Met Leu He Met Cys Thr He Leu Thr Asn Cys Val 
130 135 140 

Phe Met Thr Met Ser Asn Pro Pro Asp Trp Thr Lys Asn Val Glu Tyr 
145 150 155 160 

Thr Phe Thr Gly He Tyr Thr Phe Glu Ser Leu He Lys He Leu Ala 

165 170 175 

Arg Gly Phe Cys Leu Glu Asp Phe Thr Phe Leu Arg Asp Pro Trp Asn 

180 185 190 

Trp Leu Asp Phe Thr Val He Thr Phe Ala Tyr Val Thr Glu Phe Val 
195" 200. 205 

Asp Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu Arg Ala 
210 215 220 

Leu Lys Thr He Ser Val He Pro Gly Leu Lys Thr He Val Gly Ala 
225 230 235 240 

Leu He Gin Ser Val Lys Lys Leu Ser Asp Val Met He Leu Thr Val 

245 250 255 

Phe Cys Leu Ser Val Phe Ala Leu He Gly Leu Gin Leu Phe Met Gly 

260 265 270 

Asn Leu Arg Asn Lys Cys Leu Gin Trp Pro Pro Asp Asn Ser Ser Phe 
275 280 285 

Glu He Asn He Thr Ser Phe Phe Asn Asn Ser Leu Asp Gly Asn Gly 
290 295 300 

Thr Thr Phe Asn Arg Thr Val Ser He Phe Asn Trp Asp Glu Tyr He 
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305 310 315 320 

Glu Asp Lys Ser His Phe Tyr Phe Leu Glu Gly Gin Asn Asp Ala Leu 

325 330 335 

Leu Cys Gly Asn Ser Ser Asp Ala Gly Gin Cys Pro Glu Gly Tyr He 

340 345 350 

■ 

Cys Val Lys Ala Gly Arg Asn Pro Asn Tyr Gly Tyr Thr Ser Phe Asp 
355 360 365 

Thr Phe Ser Trp Ala Phe Leu Ser Leu Phe Arg Leu Wet Thr Gin Asp 
370 375 380 

Phe Trp Glu Asn Leu Tyr Gin Leu Thr Leu Arg Ala Ala Gly Lys Thr 
385 390 ' 395 400 

Tyr Met lie Phe Phe Val Leu Val He Phe Leu Gly Ser Phe Tyr Leu 

405 410 415 

He Asn Leu He Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gin Asn 

420 425 430 

Gin Ala Thr Leu Glu Glu Ala Glu Gin Lys Glu Ala Glu Phe Gin Gin 
435 440 445 

Met Leu Glu Gin Leu Lys Lys Gin Gin Glu Glu Ala Gin Ala Ala Ala 
450 455 460 

Ala Ala Ala Ser Ala Glu Ser Arg Asp Phe Ser Gly Ala Gly Gly He 
465 470 475 480 

Gly Val Phe Ser Glu Ser Ser Ser Val Ala Ser Lys Leu Ser Ser Lys 

485 490 495 



Ser Glu Lys Glu Leu Lys Asn Arg Arg Lys Lys Lys Lys Gin Lys Glu 

500 505 510 

Gin Ser Gly Glu Glu Glu Lys Asn Asp Arg Val Leu Lys Ser Glu Ser 
515 520 525 
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Glu Asp Ser lie Arg Arg Lys Gly Phe Arg Phe Ser Leu Glu Gly Ser 
530 535 540 



Arg Leu Thr Tyr Glu Lys Arg Phe Ser Ser Pro His Gin Ser Leu Leu 
545 • 550 555 560 



Ser lie Arg Gly Ser Leu Phe Ser Pro Arg Arg Asn Ser Arg Ala Ser 

565 570 575 



Leu Phe Ser Phe Arg Gly Arg Ala Lys Asp lie Gly Ser Glu Asn Asp 

580 585 590 

Phe Ala Asp Asp Glu His Ser Thr Phe Glu Asp Asn Asp Ser Arg Arg 
595 600 605 

Asp Ser Leu Phe Val Pro His Arg His Gly Glu Arg Arg His Ser Asn 
610 615 620 

Val Ser Gin Ala Ser Arg Ala Ser Arg Val Leu Pro He Leu Pro Met 
625 630 635 640 

Asn Gly Lys Met His Ser Ala Val Asp Cys Asn Gly Val Val Ser Leu 

645 650 655 • 

Val Gly Gly Pro Ser Thr Leu Thr Ser Ala Gly Gin Leu Leu Pro Glu 

660 665 670 

Gly Thr Thr Thr Glu Thr Glu He Arg Lys Arg Arg Ser Ser Ser Tyr 
675 680 685 

His Val Ser Met Asp Leu Leu Glu Asp Pro Thr Ser Arg Gin Arg Ala 
690 695 700 

Met Ser He Ala Ser He Leu Thr Asn Thr Met Glu Glu Leu Glu Glu 
705 710 715 720 

Ser Arg Gin Lys Cys Pro Pro Cys Trp Tyr Lys Phe Ala Asn Met Cys 

725 730 735 

Leu He Trp Asp Cys Cys Lys Pro Trp Leu Lys Val Lys His Leu Val 

740 745 750 
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Asn Leu Val Val Met Asp Pro Phe Val Asp Leu Ala lie Thr lie Cys 
755 760 765 

He Val Leu Asn Thr Leu Phe Met Ala Met Glu His Tyr Pro Met Thr 
770 775 780 

Glu Gin Phe Ser Ser Val Leu Ser Val Gly Asn Leu Val Phe Thr Gly 
785 790 795 800 

lie Phe Thr Ala Glu Met Phe Leu Lys He He Ala Met Asp Pro Tyr 

805 810 815 

Tyr Tyr Phe Gin Glu Gly Trp Asn lie Phe Asp Gly Phe He Val Ser 

820 825 830 

Leu Ser Leu Met Glu Leu Gly Leu Ala Asn Val Glu Gly Leu Ser Val 
835 840 845 

Leu Arg Ser Phe Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp 
850 855 860 

Pro Thr Leu Asn Met Leu He Lys He He Gly Asn Ser Val Gly Ala 
8^5 870 875 880 

Leu Gly Asn Leu Thr Leu Val Leu Ala He He Val Phe He Phe Ala 

885 890 895 

Val Val Gly Met Gin Leu Phe Gly Lys Ser Tyr Lys Glu Cys Val Cys 

900 905 910 

Lys He Ser Asn Asp Cys Glu Leu Pro Arg Trp His Met His Asp Phe 
915 920 925 

Phe His Ser Phe Leu He Val Phe Arg Val Leu Cys Gly Glu Trp He 
, 930 935 940 

Glu Thr Met Trp Asp Cys Met Glu Val Ala Gly Gin Thr Met Cys Leu 
545 950 955 960 

Thr Val Phe Met Met Val Met Val He Gly Asn Leu Val Val Leu Asn 

965 970 975 
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Leu Phe Leu Ala Leu Leu Leu Ser Ser Phe Ser Ser Asp Asn Leu Ala 

980 985 990 

Ala Thr Asp Asp Asp Asn Glu Met Asn Asn lie Gin lie Ala Val Gly 
995 1000 1005 

Arg Met Gin Lys Gly lie Asp Phe Val Lys Arg Lys lie Arg Glu 
1010 1015 1020 

Phe lie Gin Lys Ala Phe Val Arg Lys Gin Lys Ala Leu Asp Glu 
1025 1030 1035 

He Lys Pro Leu Glu Asp Leu Asn Asn Lys Lys Asp Ser Cys He 
1040 1045 1050 

Ser Asn His Thr Thr He Glu He Gly Lys Asp Leu Asn Tyr Leu 
1055 - 1060 1065 

Lys Asp Gly Asn Gly Thr Thr Ser Gly He Gly Ser Ser Val Glu 
1070 1075 1080 

Lys Tyr Val Val Asp Glu Ser Asp Tyr Met Ser Phe He Asn Asn 
1085 1090 1095 

Pro Ser Leu Thr Val Thr Val Pro He Ala Val Gly Glu Ser Asp 
1100 1105 1110 

Phe Glu Asn Leu Asn Thr Glu Glu Phe Ser Ser Glu Ser Asp Met 
1115 1120 1125 

Glu Glu Ser Lys Glu Lys Leu Asn Ala Thr Ser Ser Ser Glu Gly 
1130 1135 1140 

Ser Thr Val Asp He Gly Ala Pro Ala Glu Gly Glu Gin Pro Glu 
1145 1150 1155 

Val Glu Pro Glu Glu Ser Leu Glu Pro Glu Ala Cys Phe Thr Glu 
1160 1165 1170. 

< 

Asp Cys Val Arg Lys Phe Lys Cys Cys Gin lie Ser lie Glu Glu 
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1175 1180 1185 

Gly Lys Gly Lys Leu Trp Trp Asn Leu Arg Lys Thr Cys Tyr Lys 
1190 1195 1200 

lie Val Glu His Asn Trp Phe Glu Thr Phe He Val Phe Met He 
1205 1210 1215 

Leu Leu Ser Ser Gly Ala Leu Ala Phe Glu Asp He Tyr He Glu 
1220 1225 1230 

Gin Arg Lys Thr He Lys Thr Met Leu Glu Tyr Ala Asp Lys Val 
1235 1240 1245 

Phe Thr Tyr He Phe He Leu Glu Met Leu Leu Lys Trp Val Ala 
1250 1255 1260 

Tyr Gly Phe Gin Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp 
1265 1270 1275 

Phe Leu He Val Asp Val Ser Leu Val Ser Leu Thr Ala Asn Ala 
1280 1285 1290 

Leu Gly Tyr Ser Glu Leu Gly Ala He Lys Ser Leu Arg Thr Leu 
1295 1300 1305 

Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser Arg Phe Glu Gly Met 
1310 1315 1320 

i 

Arg Val Val Val Asn Ala Leu Leu Gly Ala He Pro Ser He Met 
1325 1330 1335 

Asn Val Leu Leu Val Cys Leu He Phe Trp Leu He Phe Ser He 
1340 1345 1350 

Met Gly Val Asn Leu Phe Ala Gly Lys Phe Tyr His Cys He Asn 
1355 1360 1365 

Tyr Thr Thr Gly Glu Met Phe Asp Val Ser Val Val Asn Asn Tyr 
1370 1375 1380 
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Ser Glu Cys Lys Ala Leu lie Glu Ser Asn Gin Thr Ala Arg Trp 

1385 1390 1395 

« 

Lys Asn Val Lys Val Asn Phe Asp Asn Val Gly Leu Gly Tyr Leu 
1400 1405 1410 

Ser Leu Leu Gin Val Ala Thr Phe Lys Gly Trp Met Asp lie Met 
1415 1420 1425 

Tyr Ala Ala Val Asp Ser Arg Asn Val Glu Leu Gin Pro Lys Tyr 
1430 1435 1440 

Glu Asp Asn Leu Tyr Met Tyr Leu Tyr Phe Val lie Phe lie He 
1445 1450 1455 

Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe He Gly Val He He 
1460 1465 1470 

Asp Asn Phe Asn Gin Gin Lys Lys Lys Phe Gly Gly Gin Asp He 
1475 1480 1485 

Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys 
1490 1495 1500 

Leu Gly Ser Lys Lys Pro Gin Lys Pro He Pro Arg Pro Ala Asn 
1505 1510 1515 

Lys Phe Gin Gly Met Val Phe Asp Phe Val Thr Lys Gin Val Phe 
1520 1525 1530 

* 

Asp He Ser He Met He Leu He Cys .Leu Asn Met Val Thr Met 
1535 1540 1545 



Met Val Glu Thr Asp Asp Gin Ser Gin Glu Met Thr Asn He Leu 
1550 1555 1560 



Tyr Trp He Asn Leu Val Phe He* Val. Leu Phe Thr Gly Glu Cys 
1565 • 1570 1575 



Val Leu Lys Leu He Ser Leu Arg Tyr Tyr Tyr Phe Thr He Gly 
1580 1585 " 1590 
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Trp Asn lie 
1595 



Met Phe Leu 
1610 



Leu Phe Arg 
1625 



Leu lie Lys 
1640 



Met Met Ser 
1655 



Leu Val Met 
1670 



Tyr Val Lys 
1685 



Thr Phe Gly 
1700 



Ala Gly Trp 
1715 



Pro Asp Cys 
1730 



Gly Asp Cys 
1745 



Tyr lie lie 
1760 



Val lie Leu 
1775 



Pro Leu Ser 
1790 



Phe Asp Phe 



Ala Glu Leu 



Val lie Arg 



Gly Ala Lys 



Leu Pro Ala 



Phe lie Tyr 



Arg Glu Val 



Asn Ser Met 



Asp Gly Leu 



Asp Pro Asp 



Gly Asn Pro 



lie Ser Phe 



Glu Asn Phe 



Glu Asp Asp 



Val Val Val 
1600 



He Glu Lys 
1615 



Leu Ala Arg 
1630 



Gly He Arg 
1645 



Leu Phe Asn 
1660 



Ala He Phe 
1675 



Gly He Asp 
1690 



He Cys Leu 
1705 



Leu Ala Pro 
1720 



Lys Asp His 
1735 



Ser Val Gly 
1750 



Leu Val Val 
1765 



Ser Val Ala 
1780 



Phe Glu Met 
1795 
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He Leu Ser 
1605 



Tyr Phe Val 
1620 



He Gly Arg 
1635 



Thr Leu Leu 
1650 



He Gly Leu 
1665 



Gly Met Ser 
1680 



Asp Met Phe 
1695 



Phe Gin He 
1710 



He Leu Asn 
1725 



Pro Gly Ser 
1740 



He Phe Phe 
1755 



Leu Asn Met 
1770 



Thr Glu Glu 
1785 



Phe Tyr Glu 
1800 



He Val Gly 



Ser Pro Thr 



He Leu Arg 



Phe Ala Leu 



Leu Leu Phe 



Asn Phe Ala 



Asn Phe Glu 



Thr Thr Ser 



Ser Gly Pro 



Ser Val Lys 



Phe Val Ser 



Tyr He Ala 



Ser Ala Glu 



Val Trp Glu 
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Lys Phe Asp Pro Asp Ala Thr Gin Phe He Glu Phe Ala Lys Leu 
1805 1810 1815 

Ser Asp Phe Ala Asp Ala Leu Asp Pro Pro Leu Leu He Ala Lys 
1820 1825 1830 

Pro Asn Lys Val Gin Leu He Ala Met Asp Leu Pro Met Val Ser 
1835 1840 1845 

Gly Asp Arg He His Cys Leu Asp He Leu Phe Ala Phe Thr Lys 
1850 1855 1860 

Arg Val Leu. Gly Glu Ser Gly Glu Met Asp Ala Leu Arg He Gin 
1865 1870 1875 

Met Glu Glu Arg Phe Met Ala Ser Asn Pro Ser Lys Val Ser Tyr 
1880 1885 1890 

Glu - Pro He Thr Thr Thr Leu Lys Arg Lys Gin Glu Glu Val Ser 
1895 1900 1905 

Ala He He He Gin Arg Ala Tyr Arg Arg Tyr Leu Leu Lys Gin 
1910 1915 1920 

Lys Val Lys Lys Val Ser Ser He Tyr Lys Lys Asp Lys Gly Lys 
1925 1930 1935 

Glu Cys Asp Gly Thr Pro He Lys Glu Asp Thr Leu He Asp Lys 
1940 1945 1950 

Leu Asn Glu Asn Ser Thr Pro Glu Lys Thr Asp Met Thr Pro Ser 
1955 1960 1965 

Thr Thr Ser Pro Pro Ser Tyr Asp Ser Val Thr Lys Pro Glu Lys 
1970 1975 1980 

Glu Lys Phe Glu Lys Asp Lys Ser Glu Lys Glu Asp Lys Gly Lys 
1985 1990 1995 



Asp He Arg Glu Ser Lys Lys 
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2000 2005 



<210> 67 

<211> 2005 

<212> PRT 

<213> Homo sapiens 

<400> 67 

Met Ala Gin Ser Val Leu Val Pro Pro Gly Pro Asp Ser Phe Arg Phe 
1 5 10 15 

Phe Thr Arg Glu Ser Leu Ala Ala lie Glu Gin Arg He Ala Glu Glu 

20 25 30 

Lys Ala Lys Arg Pro Lys Gin Glu Arg Lys Asp Glu Asp Asp Glu Asn 
35 40 45 

Gly Pro Lys Pro Asn Ser Asp Leu Glu Ala Gly Lys Ser Leu Pro Phe 
50 55 60 

He Tyr Gly Asp He Pro Pro Glu Met Val Ser Val Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Tyr Tyr He Asn Lys Lys Thr Phe He Val Leu Asn Lys 

85 90 95 

Gly Lys Ala lie Ser Arg Phe Ser Ala Thr Pro Ala Leu Tyr He Leu 

100. 105 110 

Thr Pro Phe Asn Pro He Arg Lys Leu Ala He Lys He Leu Val His 
115 120 125 

Ser Leu Phe Asn Met Leu He Met Cys Thr He Leu Thr Asn Cys Val 
130 ■ 135 140 

Phe Met Thr Met Ser Asn Pro Pro Asp Trp Thr Lys Asn Val Glu Tyr 
145 150 155 160 

Thr Phe Thr Gly He Tyr Thr Phe Glu Ser Leu He Lys He Leu Ala 

165 170 175 

Arg Gly Phe Cys Leu Glu Asp Phe Thr Phe Leu Arg Asp Pro Trp Asn 

Page 180 



SequencesSSCPre-f ile7August03.ST25 . txt 
180 185 190 



Trp Leu Asp Phe 
195 



Asp Leu Gly Asn 
210 



Leu Lys Thr lie 
225 



Leu lie Gin Ser 



Phe Cys Leu Ser 

260 



Asn Leu Arg Asn 
275 



Glu lie Asn lie 
290 



Thr Thr Phe Asn 
305 



Glu Asp Lys Ser 



Leu Cys Gly Asn 

340 



Cys Val Lys Ala 
355 



Thr Phe Ser Trp 
370 



Phe Trp Glu Asn 
385 



Thr Val He Thr 

200 



Val Ser Ala Leu 
215 



Ser Val He Pro 
230 



Val Lys Lys Leu 
245 



Val Phe Ala Leu 



Lys Cys Leu Gin 

280 



Thr Ser Phe Phe 
295 



Arg Thr Val Ser 
310 



His Phe Tyr Phe 
325 



Ser Ser Asp Ala 



Gly Arg Asn Pro 

360 



Ala Phe Leu Ser 
375 



Leu Tyr Gin Leu 
390 



Phe Ala Tyr Val 



Arg Thr Phe Arg 

220 



Gly Leu Lys Thr 
235 



Ser Asp Val Met 
250 



He Gly Leu Gin 
265 



Trp Pro Pro Asp 



Asn Asn Ser Leu 

300 



He Phe Asn Trp 
315 



Leu Glu Gly Gin 
330 



Gly Gin Cys Pro 
345 



Asn Tyr Gly Tyr 



Leu Phe Arg Leu 

380 



Thr Leu Arg Ala 
395 



Thr Glu Phe Val 
205 



Val Leu Arg Ala 



He Val Gly Ala 

240 



He Leu Thr Val 
255 

i 



Leu Phe Met Gly 
27 0 



Asn Ser Ser Phe ' 
285 



Asp Gly Asn Gly 



Asp Glu Tyr He 

320 



Asn Asp Ala Leu 
335 



Glu Gly Tyr He 
3 50 



Thr Ser Phe Asp 
365 



Met Thr Gin Asp 



Ala Gly Lys Thr 

400 
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Tyr Met He Phe Phe Val Leu Val lie Phe Leu Gly Ser Phe Tyr Leu 

405 410 415 

He Asn Leu He Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gin Asn 

420 425 430 

Gin Ala Thr Leu Glu Glu Ala Glu Gin Lys Glu Ala Glu Phe Gin Gin 
435 440 445 

Met Leu Glu Gin Leu Lys Lys Gin Gin Glu Glu Ala Gin Ala Ala Ala 
450 455 460 

Ala Ala Ala Ser Ala Glu Ser Arg Asp Phe Ser Gly Ala Gly Gly lie 
465 470 475 480 

Gly Val Phe Ser Glu Ser Ser Ser Val Ala Ser Lys Leu Ser Ser Lys 

485 490 495 

Ser Glu Lys Glu Leu Lys Asn Arg Arg Lys Lys Lys Lys Gin Lys Glu 

500 505 510 

Gin Ser Gly Glu Glu Glu Lys Asn Asp Arg Val Leu Lys Ser Glu Ser 
515 520 525 

Glu Asp Ser He Arg Arg Lys Gly Phe Arg Phe Ser Leu Glu Gly Ser 
530 535 540 

Arg Leu Thr Tyr Glu Lys Arg Phe Ser Ser Pro His Gin Ser Leu Leu 
545 550 555 560 

Ser He Arg Gly Ser Leu Phe Ser Pro Arg Arg Asn Ser Arg Ala Ser 

565 570 575 

Leu Phe Ser Phe Arg Gly Arg Ala Lys Asp He Gly Ser Glu Asn Asp 

580 585 590 

Phe Ala Asp Asp Glu His Ser Thr Phe Glu Asp Asn Asp Ser Arg, Arg 
595 600 605 

Asp Ser Leu Phe Val Pro His Arg His Gly Glu Arg Arg His Ser Asn 
610 615 620 
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Val Ser Gin Ala Ser Arg Ala Ser Arg Val Leu Pro lie Leu Pro Met 
625 630 635 640 

Asn Gly Lys Met His Ser Ala Val Asp Cys Asn Gly Val Val Ser Leu 

645 650 655 

Val Gly Gly Pro Ser Thr Leu Thr Ser Ala Gly Gin Leu Leu Pro Glu 

660 665 670 

Gly Thr Thr Thr Glu Thr Glu lie Arg Lys Arg Arg Ser Ser Ser Tyr 
675 680 685 

His Val Ser Met Asp Leu Leu Glu Asp Pro Thr Ser Arg Gin Arg Ala 
690 695 700 



Met Ser lie Ala Ser lie Leu Thr Asn Thr Met Glu Glu Leu Glu Glu 
705 710 715 720 



Ser Arg Gin Lys Cys Pro Pro Cys Trp Tyr Lys Phe Ala Asn Met. Cys 

725 730 735 

Leu He Trp Asp Cys Cys Lys Pro Trp Leu Lys Val Lys His Leu Val 

740 745 750 

Asn Leu Val Val Met Asp Pro Phe Val Asp Leu Ala He Thr He Cys 
755 760 765 

He Val Leu Asn Thr Leu Phe Met Ala Met Glu His Tyr Pro Met Thr 
770 775 780 

Glu Gin Phe Ser Ser Val . Leu Ser Val Gly Asn Leu Val Phe Thr Gly 
785 790 795 800 



He Phe Thr Ala Glu Met Phe Leu Lys He He Ala Met Asp Pro Tyr 

805 810 815 

Tyr Tyr Phe Gin Glu Gly Trp Asn He Phe Asp Gly Phe He Val Ser 

820 825 830 

Leu Ser Leu Met Glu Leu Gly Leu Ala Asn Val Glu Gly Leu Ser Val 
835 840 845 
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Leu Arg Ser Phe Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp 
850 855 860 

Pro Thr Leu Asn Met Leu lie Lys He He Gly Asn Ser Val Gly Ala 
865 870 875 880 

Leu Gly Asn Leu Thr Leu Val Leu Ala He He Val Phe He Phe Ala 

885 890 895 

Val Val Gly Met Gin Leu Phe Gly Lys Ser Tyr Lys Glu Cys Val Cys 

900 905 910 

Lys He Ser Asn Asp Cys Glu Leu Pro Arg Trp His Met His Asp Phe 
915 920 925 

Phe His Ser Phe Leu He Val Phe Arg Val Leu Cys Gly Glu Trp He 
930 935 940 

Glu Thr Met Trp Asp Cys Met Glu Val Ala Gly Gin Thr Met Cys Leu 
945 950 955 960 

Thr Val Phe Met Met Val Met Val He Gly Asn Leu Val Val Leu Asn 

965 970 975 

Leu Phe Leu Ala Leu Leu Leu Ser Ser Phe Ser Ser Asp Asn Leu Ala 

980 985 990 

Ala Thr Asp Asp Asp Asn Glu Met Asn Asn Leu Gin lie Ala Val Gly 
995 1000 1005 

Arg Met Gin Lys Gly He Asp Phe Val Lys Arg Lys He Arg Glu 
1010 1015 1020 

Phe He Gin Lys Ala Phe Val Arg Lys Gin Lys Ala Leu Asp Glu 
1025 1030 1035 

He Lys Pro Leu Glu Asp Leu Asn Asn Lys Lys Asp Ser Cys He 
1040 1045 1050 



Ser Asn His Thr Thr He Glu He Gly Lys Asp Leu Asn Tyr Leu 
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1055 1060 1065 

Lys Asp Gly Asn Gly Thr Thr Ser Gly lie Gly Ser Ser Val Glu 
1070 1075 1080 

Lys Tyr Val Val Asp Glu Ser Asp Tyr Met Ser Phe lie Asn Asn 
1085 1090 1095 

Pro Ser Leu Thr Val Thr Val Pro He Ala Val Gly Glu Ser Asp 
1100 1105 1110 

Phe Glu Asn Leu Asn Thr Glu Glu Phe Ser Ser Glu Ser Asp Met 
1115 1120 1125 

Glu Glu Ser Lys Glu Lys Leu Asn Ala Thr Ser Ser Ser Glu Gly 
1130 1135 1140 

Ser Thr Val Asp He Gly Ala Pro Ala Glu Gly Glu Gin Pro Glu 
1145 1150 1155 

Val Glu Pro Glu Glu Ser Leu Glu Pro Glu Ala Cys Phe Thr Glu 
1160 1165 1170 

Asp Cys Val Arg Lys Phe Lys Cys Cys Gin He Ser lie Glu Glu 
1175 1180 1185 

Gly Lys Gly Lys Leu Trp Trp Asn Leu Arg Lys Ala Cys Tyr Lys 
1190 1195 1200 

He Val Glu His Asn Trp Phe Glu Thr Phe He Val Phe Met He 
1205 1210 1215 



Leu Leu Ser Ser Gly Ala Leu Ala Phe Glu Asp He Tyr He Glu 
1220 1225 1230 



Gin Arg Lys Thr He Lys Thr Met Leu Glu Tyr Ala Asp Lys Val 
1235 1240 1245 



Phe Thr Tyr He Phe He Leu Glu Met Leu Leu Lys Trp Val Ala 
1250 1255 1260 
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Tyr Gly Phe Gin Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp 
1265 1270 1275 



Phe Leu lie 
1280 



Leu Gly Tyr 
1295 



Arg Ala Leu 
1310 



Arg Val Val 
1325 



Asn Val Leu 
1340 



Met Gly Val 
1355 



Tyr Thr Thr 
1370 



Ser Glu Cys 
1385 



Lys Asn Val 
1400 



Ser Leu Leu 
1415 



Tyr Ala Ala 
1430 



Glu Asp Asn 
1445 



Phe Gly Ser 
1460 



Val Asp Val 



Ser Glu Leu 



Arg Pro Leu 



Val Asn Ala 



Leu Val Cys 



Asn Leu Phe 



Gly Glu Met 



Lys Ala Leu 



Lys Val Asn 



Gin Val Ala 



Val Asp Ser 



Leu Tyr Met 



Phe Phe Thr 



Ser Leu Val 
1285 



Gly Ala lie 
1300 



Arg Ala Leu 
1315 



Leu Leu Gly 
1330 



Leu lie Phe 
1345 



Ala Gly Lys 
1360 



Phe Asp Val 
1375 



He Glu Ser 
1390 



Phe Asp Asn 
1405 



Thr Phe Lys 
1420 



Arg Asn Val 
1435 



Tyr Leu Tyr 
1450 



Leu Asn Leu 
1465 



Ser Leu Thr 
1290 



Lys Ser Leu 
1305 



Ser Arg Phe 
1320 



Ala He Pro 
1335 



Trp Leu He 
1350 



Phe Tyr His 
1365 



Ser Val Val 
1380 



Asn Gin Thr 
1395 



Val Gly Leu 
1410 



Gly Trp Met 
1425 



Glu Leu Gin 
1440 



Phe Val He 
1455 



Phe He Gly 
1470 



Ala Asn Ala 



Arg Thr Leu 



Glu Gly Met 



Ser He Met 



Phe Ser He 



Cys He Asn 



Asn Asn Tyr 



Ala Arg Trp 



Gly Tyr Leu 



Asp He Met 



Pro Lys Tyr 



Phe He He 



Val He He 
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Asp Asn Phe Asn Gin Gin Lys Lys Lys Phe Gly Gly Gin Asp lie 
1475 1480 1485 



Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys 

1490 1495 1500 

Leu Gly Ser Lys Lys Pro Gin Lys Pro lie Pro Arg Pro Ala Asn 

1505 1510 1515 

Lys Phe Gin Gly Met Val Phe Asp Phe Val Thr Lys Gin Val Phe 

1520 1525 1530 

Asp He Ser He Met He Leu He Cys Leu Asn Met Val Thr Met 

1535 1540 1545 

Met Val Glu Thr Asp Asp Gin Ser Gin Glu Met Thr .Asn He Leu 

1550 1555 1560 

Tyr Trp He Asn Leu Val Phe He Val Leu Phe Thr Gly Glu Cys 

1565 1570 1575 

Val Leu Lys Leu He Ser Leu Arg Tyr Tyr Tyr Phe Thr He Gly 

1580 1585 1590 

Trp Asn He Phe Asp Phe Val Val Val He Leu Ser He Val Gly 

1595 1600 1605 

Met Phe Leu Ala Glu Leu He Glu Lys Tyr Phe Val Ser Pro Thr 

1610 1615 1620 

« 

Leu Phe Arg Val He Arg Leu Ala Arg He Gly Arg lie Leu Arg 

1625 1630 1635 

Leu He Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe Ala Leu 

1640 1645 1650 

Met Met Ser Leu Pro Ala Leu Phe Asn He Gly Leu Leu Leu Phe 

1655 1660 1665 

Leu Val Met Phe He Tyr Ala He Phe Gly Met Ser Asn Phe Ala 

1670 1675 1680 
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Tyr Y« c LyS Ar9 Glu Val Gly Ile As P As P Met phe A ^n Phe Glu 
1685 1690 1695 

r 

Thr F™« Gly Asn Ser Met Ile °y s heu Ph e Gin Ile Thr Thr Ser 
. 1700 1705 1710 

Ala Sly Trp Asp Gly Leu Leu Ala Pro Ile Leu Asn Ser Gly Pro 
1715 , 1720 1725 

Pro A ??„ Cys As P Pro As P L ys Asp His Pro Gly Ser Ser Val Lys 
1730 1735 1740 

Gly Asp Cys Gly Asn Pro Ser Val Gly Ile Phe Phe Phe Val Ser 
1745 1750 1755 

Tyr Ile Ile Ile Ser Phe Leu Val Val Leu Asn Met Tyr Ile Ala 
1760 1765 1770 

Val Ile Leu Glu Asn Phe Ser Val Ala Thr Glu Glu Ser Ala Glu 
1775 1780 1785 

Pro Leu Ser Glu Asp Asp Phe Glu Met Phe Tyr Glu Val Trp Glu 
1790 1795 1800 

Lys Phe Asp Pro Asp Ala Thr Gin Phe Ile Glu Phe Ala Lys Leu 
1805 1810 1815 

Ser Asp Phe Ala Asp Ala Leu Asp Pro Pro Leu Leu Ile Ala Lvs 
1820 1825 1830 

Pro Asn Lys Val Gin Leu Ile Ala Met Asp Leu Pro Met Val Ser 
1835 1840 1845 

Gly Asp Arg Ile His Cys Leu Asp Ile Leu Phe Ala Phe Thr Lys 
1850 1855 I860 

Arg Val Leu Gly Glu Ser Gly Glu Met Asp Ala Leu Arg Ile Gin 
1865 1870 ~ 1875 



Met Glu Glu Arg Phe Met Ala Ser Asn Pro Ser Lys Val Ser Tyr 
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1880 1885 1890 

G1U ?J£ K Ile Thr Thr Thr Leu Ar 9 Gin Glu Glu Val Ser 

1835 1900 1905 

Ala ^i?n 116 116 Gln Ar9 ? la ' fyr Ar 9 Ar ^ ^Vr Leu Leu Lys Gin 
1910 1915 1920 

LyS Yto* LyS LyS Val Ser Ser Ile ^ Asp Lys Gly Lys 

1930 1935 

G1U ?Kn ASP Gly Thr Pr ° Ile Lys Glu As P Thr Leu He Asp Lys 
iy4U 1945 1950 

LSU ^S? c G1U Asn Ser Thr Pro Glu Lys Thr As P Thr Pro Ser 

19 ^5 I960 1965 

Thr Tq™ Pr ° Pr ° Ser Tyr Asp Ser Val Thr L ^ s Pro Glu Lys 

1970 1975 1980 

G1U Phe G1U LyS Asp Lys Ser Glu L V S Glu Asp Lys Gly Lys 

1985 1990 1995 

Asp lie Arg Glu Ser Lys Lys 
2000 2005 



<210> 68 

<211> 2005 

<212> PRT 

<213> Homo sapiens 

<400> 68 

Met Ala Gin Ser Val Leu Val Pro Pro Gly Pro Asp Ser Phe Arg Phe 
15 10 15 

Phe Thr Arg Glu- Ser Leu Ala Ala Ile Glu Gin Arg Ile Ala Glu Glu 

20 25 30 

Lys Ala Lys Arg Pro Lys Gin Glu Arg Lys Asp Glu Asp Asp Glu Asn 
35 40 45 



Gly Pro Lys Pro Asn Ser Asp Leu Glu Ala Gly Lys Ser Leu Pro Phe 
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50 55 60 



lie Tyr Gly Asp He Pro Pro Glu Met Yal Ser Val Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Tyr Tyr He Asn Lys Lys Thr Phe He Val Leu Asn Lys 

85 90 95 

Gly Lys Ala He Ser Arg Phe Ser Ala Thr Pro Ala Leu Tyr He Leu 

100 105 110 

Thr Pro Phe Asn Pro He Arg Lys Leu Ala He Lys He Leu Val His 
115 120 125 

Ser Leu Phe Asn Met Leu He Met Cys Thr He Leu Thr Asn Cys Val 
130 135 140 

Phe Met Thr Met Ser Asn Pro Pro Asp Trp Thr Lys Asn Val Glu Tyr 
145 150 155 160 



Thr Phe Thr Gly He Tyr Thr Phe Glu Ser Leu He Lys He Leu Ala 

165 170 175 



Arg Gly Phe Cys Leu Glu Asp Phe Thr Phe Leu Arg Asp Pro Trp Asn 

180 185 190 

Trp Leu Asp Phe Thr Val He Thr Phe Ala Tyr Val Thr Glu Phe Val 
195 200 205 

Asp Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu Arg Ala 
210 215 220 



Leu Lys Thr He Ser Val He Pro Gly Leu Lys Thr He Val Gly Ala 
225 230 235 240 

Leu He Gin Ser Val Lys Lys Leu Ser Asp Val Met. He Leu Thr Val 

245 250 255 

Phe Cys Leu Ser Val Phe Ala Leu He Gly Leu Gin Leu Phe Met Gly 

260 265 270 
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Asn Leu Arg Asn Lys Cys Leu Gin Trp Pro Pro Asp Asn Ser Ser Phe 
275 280 285 

Glu He Asn He Thr Ser Phe Phe Asn Asn Ser Leu Asp Gly Asn Glv 
290 295 300 

Thr Thr Phe Asn Arg Thr Val Ser He Phe Asn Trp Asp Glu Tvr He 
305 310 ' 315 320 

Glu Asp Lys Ser His Phe Tyr Phe Leu Glu Gly Gin Asn Asp Ala Leu 

325 330 335 

Leu Cys Gly Asn Ser Ser Asp Ala Gly Gin Cys Pro Glu Gly Tyr He 

340 345 350 

Cys Val Lys Ala Gly Arg Asn Pro Asn Tyr Gly Tyr Thr Ser Phe Asp 
355 360 365 

Thr Phe Ser Trp Ala Phe Leu Ser Leu Phe Arg Leu Met Thr Gin Asp 
370 375 380 

Phe Trp Glu Asn Leu Tyr Gin Leu Thr Leu Arg Ala Ala Gly Lys Thr 
385 390 395 400 

Tyr Met He Phe Phe Val Leu Val He Phe Leu Gly Ser Phe Tyr Leu 

405 410 415 

lie Asn Leu He Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gin Asn 

420 425 430 

Gin Ala Thr Leu Glu Glu Ala Glu Gin Lys Glu Ala Glu Phe Gin Gin 
435 440 445 

Met Leu Glu Gin Leu Lys Lys Gin Gin Glu Glu Ala Gin Ala Ala Ala 
450 455 460 

Ala Ala Ala Ser Ala Glu Ser Arg Asp Phe Ser Gly Ala Gly Gly He 
465 470 475 480 

Gly Val Phe Ser Glu Ser Ser Ser Val Ala Ser Lys Leu Ser Ser Lys 

485 490 495 
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Ser Glu Lys Glu Leu Lys Asn Arg Arg Lys Lys Lys Lys Gin Lys Glu 

500 505 510 

■ 

Gin Ser Gly Glu Glu Glu Lys Asn Asp Arg Val Leu Lys Ser Glu Ser 
515 520 525 

Glu Asp Ser He Arg Arg Lys Gly Phe Arg Phe Ser Leu Glu Gly Ser 
530 535 540 

Arg Leu Thr Tyr Glu Lys Arg Phe Ser Ser Pro His Gin Ser Leu Leu 
545 550 555 560 

Ser He Arg Gly Ser Leu Phe Ser Pro Arg Arg Asn Ser Arg Ala Ser 

565 570 575 

Leu Phe Ser Phe Arg Gly Arg Ala Lys Asp He Gly Ser Glu Asn Asp 

580 585 590 

• 

Phe Ala Asp Asp Glu His Ser Thr Phe Glu Asp Asn Asp Ser Arg Arg 
595 600 605 

Asp Ser Leu Phe Val Pro His Arg His Gly Glu Arg Arg His Ser Asn 
610 615 620 

Val Ser Gin Ala Ser Arg Ala Ser Arg Val Leu Pro He Leu Pro Met 
625 630 635 640 

Asn Gly Lys Met His Ser Ala Val Asp Cys Asn Gly Val Val Ser Leu 

645 650 655 

■ 

Val Gly Gly Pro Ser Thr Leu Thr Ser Ala Gly Gin Leu Leu Pro Glu 

660 665 670 

Gly Thr Thr Thr Glu Thr Glu He Arg Lys Arg Arg Ser Ser Ser Tyr 
675 680 685 

His Val Ser Met Asp Leu Leu Glu Asp Pro Thr Ser Arg Gin Arg Ala 
690 695 700 

Met Ser He Ala Ser He Leu Thr Asn Thr Met Glu Glu Leu Glu Glu 
705 710 715 720 
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Ser Arg Gin Lys Cys Pro Pro Cys Trp Tyr Lys Phe Ala Asn Met Cys 

725 730 735 

Leu lie Trp Asp Cys Cys Lys Pro Trp Leu Lys Val Lys His Leu Val 

740 745 750 

Asn Leu Val Val Met Asp Pro Phe Val Asp Leu Ala He Thr He Cys 
7 ^5 760 765 

■ * 

He Val Leu Asn Thr Leu Phe Met Ala Met Glu His Tyr Pro Met Thr 
770 775 780 

Glu Gin Phe Ser Ser Val Leu Ser Val Gly Asn Leu Val Phe Thr Gly 
785 790 795. 800 

lie Phe Thr Ala Glu Met Phe Leu Lys He He Ala Met Asp Pro Tyr 

805 810 815 

Tyr Tyr Phe Gin Glu Gly Trp Asn He Phe Asp Gly Phe He Val Ser 

820 825 830 

Leu Ser Leu Met Glu Leu Gly Leu Ala Asn Val Glu Gly Leu Ser Val 
835 840 845 

Leu Arg Ser Phe Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp 
850 855 860 

Pro Thr Leu Asn Met Leu He Lys He He Gly Asn Ser Val Gly Ala 
865 870 875 880 

Leu Gly Asn Leu Thr Leu Val Leu Ala He He Val Phe He Phe Ala 

885 890 895 



Val Val Gly Met Gin Leu Phe Gly Lys Ser Tyr Lys Glu Cys Val Cys 

900 905 ^ 910 



Lys He Ser Asn Asp Cys Glu Leu Pro Arg Trp His Met His Asp Phe 
915 920 925 

Phe His Ser Phe Leu He Val Phe Arg Val Leu Cys Gly Glu Trp He 
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930 935 940 

Glu Thr Met Trp Asp Cys Met Glu Val Ala Gly Gin Thr Met Cys Leu 
945 950 955 960 

Thr Val Phe Met Met Val Met Val lie Gly Asn Leu Val Val Leu Asn 

965 970 975 

Leu Phe Leu Ala Leu Leu Leu Ser Ser Phe Ser Ser Asp Asn Leu Ala 

980 985 990 

Ala Thr Asp Asp Asp Asn Glu Met Asn Asn Leu Gin lie Ala Val Gly 
995 1000 1005 

Arg Met Gin Lys Gly He Asp Phe Val Lys Arg Lys He Arg Glu 
1010 1015 1020 

Phe He Gin Lys Ala Phe Val Arg Lys Gin Lys Ala Leu Asp Glu 
1025 . 1030 >1035 

He Lys Pro Leu Glu Asp Leu Asn Asn Lys Lys Asp Ser Cys He 
1040 1045 1050 

Ser Asn His Thr Thr He Glu He Gly Lys Asp Leu Asn Tyr Leu 
1055 1060 1065 

Lys Asp Gly Asn Gly Thr Thr Ser Gly He Gly Ser Ser Val Glu 
1070 1075. 1080 

Lys Tyr Val Val Asp Glu Ser Asp Tyr Met Ser Phe He Asn Asn 
1085 1090 1095 

Pro Ser Leu Thr Val Thr Val Pro He Ala Val Gly Glu Ser Asp 
1100 1105 1110 

Phe Glu Asn Leu Asn Thr Glu Glu Phe Ser Ser Glu Ser Asp Met 
1115 1120 1125 

Glu Glu Ser Lys Glu Lys Leu Asn Ala Thr Ser Ser Ser Glu Gly 
1130 1135 1140 
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TSr ASP 116 Gly Ala Pro Ala Glu Gl y Glu Gin Pro Glu 

1150 1155 

Val ?Sn Pr ° G1U G1U Ser LeU Glvi Pro Glu Ala ph e Thr Glu 

llb0 1165 1170 

ASP Val Ar ^r Lys Phe Lys Cys Cys Gin He Ser He Glu Glu 

1175 1180 1185 

GlY ilon Gly LyS L6U Trp Trp Asn Leu Ar S Thr Cys Tyr Lys 

±iyu 1195 1200 



Ile Y a L Glu His Asn Tr P Phe Glu Thr p he He Val Phe Met He 
1205 1210 1215 

non GlY Ala LSU Ala Phe Glu As P Ile ^ He Glu 

1225 1230 



Gin Arg Lys 
1235 



Phe Thr Tyr 
1250 



Tyr Gly Phe 
1265 



Phe Leu He 
1280 



Leu Gly Tyr 
1295 



Arg Ala Leu 
1310 



Thr He Lys 



Ile Phe Ile 



Gin Val Tyr 



Val Asp Val 



Ser Glu Leu 



Arg Pro Leu 



Thr Met Leu 
1240 ■ 



Leu Glu Met 
1255 



Phe Thr Asn 
1270 



Ser Leu Val 
1285 



Gly Ala Ile 
1300 



Arg Ala Leu 
1315 . 



Glu Tyr Ala 
1245 



Leu Leu Lys 
1260 



Ala Trp Cys 
1275 



Ser Leu Thr 
1290 



Lys Ser Leu 
1305 



Ser Gin Phe 
1320 



Asp Lys Val 



Trp Val Ala 



Trp Leu Asp 



Ala Asn Ala 



Arg Thr Leu 



Glu Gly Met 



Arg Val Val Val Asn Ala Leu Leu Gly Ala He Pro Ser Ile Met 
* 1325 1330 1335 



Asn Val Leu Leu Val Cys Leu lie Phe Trp Leu Ile Phe Ser He 
I 340 1345 1350 
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Met Gly Val Asn Leu Phe Ala Gly Lys Phe Tyr His Cys He Asn 
"55 1360 1365 

Tyr Thr Thr Gly Glu Met Phe Asp Val Ser Val Val Asn Asn Tyr 
XJ/U 1375 1380 

Ser Glu Cys Lys Ala Leu He Glu Ser Asn Gin Thr Ala Arg Trp 

1390 1395 

??nn LYS Phe ASP ASn Val G1 V Leu Gl y Leu 

1405 1410 

Ser Leu Leu Gin Val Ala Thr Phe Lys Gly Trp Met Asp He Met 
141 5 1420 1425 

Tyr Ala Val Asp Ser Asn Val Glu Leu Gin Pro Lys Tyr 

1430 1435 1440 

Glu Asp Asn Leu Tyr Met Tyr Leu Tyr Phe Val He Phe lie He 
1445 1450 1455 

Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe He Gly Val He He 
1^60 1465 1470 

Asp Asn Phe Asn Gin Gin Lys Lys Lys Phe Gly Gly Gin Asp He 
147 5 1480 1485 

Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys 
14 90 1495 1500 

LeU Ser Pro Gl * Lys Pro lie Pro Arg Pro Ala Asn 

1505 1510 1515 

Lys Phe Gin Gly Met Val Phe Asp Phe Val Thr Lys Gin Val Phe 
!520 1525 1530 

Asp He Ser He Met He Leu He Cys Leu Asn Met Val Thr Met 
15 35 1540 1545 

Met Val Glu Thr Asp Asp Gin Ser Gin Glu Met Thr Asn He Leu 
155 0 1555 1560 
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Tyr Trp lie Asn Leu val Phe He val Leu Phe Thr Gly Glu Cys 
1565 1570 1575 ' y 

Val hyS Leu Ile Ser Leu Ar 9 Tyr Tyr Tyr Phe Thr He Gly 

1580 1585 1590 

Trp Asn Ile Phe Asp Phe Val Val Val Ile Leu Ser Ile Val Gly 
1595 1600 1605 

Met Phe Leu Ala Glu Leu Ile Glu Lys Tyr Phe Val Ser Pro Thr 
1610 1615 1520 

Leu Phe Arg Val Ile Arg Leu Ala Arg Ile Gly Arg Ile Leu Arg 
1625 1630 1635 

Leu Ile Lys Gly Ala Lys Gly Ile Arg Thr Leu Leu Phe Ala Leu 
1^40 1645 1650 

Met Met Ser Leu Pro Ala Leu Phe Asn Ile Gly Leu Leu Leu Phe 
1655 1660 1665 

Leu Val Met Phe Ile Tyr Ala Ile Phe Gly Met Ser Asn Phe Ala 
1670 1675 1680 

Tyr Val Lys Arg Glu Val Gly Ile Asp Asp Met Phe Asn Phe Glu 
1685 1690 1695 

■ 

Thr Phe Gly Asn Ser Met Ile Cys Leu Phe Gin Ile Thr Thr Ser 
1700 1705 1710 

Ala Gly Trp Asp Gly Leu Leu Ala Pro Ile Leu Asn Ser Gly Pro 
1715 1720 1725 

Pro Asp Cys Asp Pro Asp Lys Asp His Pro Gly Ser Ser Val Lys 
1730 1735 1740 

Gly Asp Cys Gly Asn Pro Ser Val Gly Ile Phe Phe - Phe Val Ser 
1745 , 1750 * 1755 



Tyr Ile Ile lie Ser Phe Leu Val Val Leu Asn Met Tyr Ile Ala 
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1760 1765 1770 

Val lie Leu Glu Asn Phe Ser Val Ala Thr Glu Glu Ser Ala Glu 
1775 1780 1785 

Pro Leu Ser Glu Asp Asp Phe Glu Met Phe Tyr Glu Val Trp Glu 
1 7 90 1795 1800 

Lys Phe Asp Pro Asp Ala Thr Gin Phe He Glu Phe Ala Lys Leu 
1805 1810 1815 

Ser Asp Phe Ala Asp Ala Leu Asp Pro Pro Leu Leu He Ala Lys 
1820 1825 1830 

Pro Asn Lys Val Gin Leu He Ala Met Asp Leu Pro Met Val Ser 
1835 1840 1845 

Gly Asp Arg He His Cys Leu Asp He Leu Phe Ala Phe Thr Lys 
1850 1855 I860 

Arg Val Leu Gly Glu Ser Gly Glu Met Asp Ala Leu Arg He Gin 
1865 1870 1875 

Met Glu Glu Arg Phe Met Ala Ser Asn Pro Ser Lys Val Ser Tyr 
1880 1885 1890 

Glu Pro He Thr Thr Thr Leu Lys Arg Lys Gin Glu Glu Val Ser 
1895 1900 1905 

Ala He He He Gin Arg Ala Tyr Arg Arg Tyr Leu Leu Lys Gin 
1910 1915 1920 

Lys Val Lys Lys Val Ser Ser He Tyr Lys Lys Asp Lys Gly Lys 
1925 1930 1935 

Glu Cys Asp Gly Thr Pro He Lys Glu Asp Thr Leu lie Asp Lys 
1940 1945 1950 

Leu Asn Glu Asn Ser Thr Pro Glu Lys . Thr Asp Met Thr Pro Ser 
1955 1960 1965 
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Thr Thr Ser Pro Pro Ser Tyr Asp Ser Val Thr Lys Pro Glu Lys 
1970 1975 1980 

Glu Lys Phe Glu Lys Asp Lys Ser Glu Lys Glu Asp Lys Gly Lys 
1985 1990 1995 

Asp lie Arg Glu Ser Lys Lys 
2000 2005 

<210> 69 

<211> 468 

<212> PRT 

<213> Homo sapiens 

<400> 69 

Met Ala Ala Arg Gly Ser Gly Pro Arg Ala Leu Arg Leu Leu Leu Leu 
15 10 15 

Val Gin Leu Val Ala Gly Ala Leu Arg Ser Ser Arg Ala Arg Arg Ala 

20 25 30 

Ala Arg Arg Gly Leu Ser Glu Pro Ser Ser lie Ala Lys His Glu Asp 
35 40 45 

Ser Leu Leu Lys Asp Leu Phe Gin Asp Tyr Glu Arg Trp Val Arg Pro 
50 55 60 

Val Glu His Leu Asn Asp Lys lie Lys lie Lys Phe Gly Leu Ala lie 
65 70 75 80 

Ser Gin Leu Val Asp Val Asp Glu Lys Asn Gin Leu Met Thr Thr Asn 

85 90 95 

Val Trp Leu Lys Gin Glu Trp lie Asp Val Lys Leu Arg Trp Asn Pro 

100 105 110 

Asp Asp Tyr Gly Gly He Lys Val He Arg Val Pro Ser Asp Ser Ser 
115 120 125 

Trp Thr Pro Asp He He Leu Phe Asp Asn Ala Asp Gly Arg Phe Glu 
130 135 140 
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Gly Thr Ser Thr Lys Thr Val He Arg Tyr Asn Gly Thr Val Thr Trp 
145 150 155 160 



Thr Pro Pro Ala Asn Tyr Lys Ser Ser Cys Thr lie Asp Val Thr Phe 

165 170 175 

Phe Pro Phe Asp Leu Gin Asn Cys Ser Met Lys Phe Gly Ser Trp Thr 

/ 180 185 190 

Tyr Asp Gly Ser Gin Val Asp He He Leu Glu Asp Gin Asp Val Asp 
195 200 205 

Lys Arg Asp Phe Phe Asp Asn Gly Glu Trp Glu He Val Ser Ala Thr 
210 215 220 

Gly Ser Lys Gly Asn Arg Thr Asp Ser Cys Cys Trp Tyr Pro Tyr Val 
225 ' 230 235 240 

Thr Tyr Ser Phe Val He Lys Arg Leu Pro Leu Phe Tyr Thr Leu Phe 

245 250 255 

Leu He He Pro Cys He Gly Leu Ser Phe Leu Thr Val Leu Val Phe 

260 265 270 

Tyr Leu Pro Ser Asn Glu Gly Glu Lys He Cys Leu Cys Thr Ser Val 
275 280 285 



LeU Val Ser Leu Thr Val Phe Leu Leu Val He Glu Glu He He Pro 

290 . 295 300 

Ser Ser Ser Lys Val lie Pro Leu He Gly Glu Tyr Leu Val Phe Thr 

305 310 315 320 



Met He Phe Val Thr Leu Ser He Met Val Thr Val Phe Ala He Asn 

325 330 335 

He His His Arg Ser Ser Ser Thr His Asn Ala Met Ala Pro Leu Val 

340 345 350 

Arg Lys He Phe Leu His Thr Leu Pro Lys Leu Leu Ser Met Arg Ser 
355 360 365 



Page 200 



SequencesSSCPre-f ile7August03 . ST25 . txt 

His Val Asp Arg Tyr Phe Thr Gin Lys Glu Glu Thr Glu Ser Gly Ser 
370 375 380 

* 

Gly Pro Lys Ser Ser Arg Asn Thr Leu Glu Ala Ala Leu Asp Ser He 
385 390 395 400 

Arg Tyr He Thr Thr His He Met Lys Glu Asn Asp Val Arg Glu Val 

405 410 415 

Val Glu Asp Trp Lys Phe He Ala Gin Val Leu Asp Arg Met Phe Leu 

420 425 430 

Trp Thr Phe Leu Phe Val Ser He Val Gly Ser Leu Gly Leu Phe Val 
435 440 445 

Pro Val He Tyr Lys Trp Ala Asn" He Leu He Pro Val His He Gly 
450 455 460 

Asn Ala Asn Lys 
465 



<210> 70 

<211> 529 

<212> PRT 

<213> Homo sapiens 

<400> 70 

Met Gly Pro Ser Cys Pro Val Phe Leu Ser Phe Thr Lys Leu Ser Leu 
15 10 15 

Trp Trp Leu Leu Leu Thr Pro Ala Gly Gly Glu Glu Ala Lys Arg Pro 

20 25 30 

Pro Pro Arg Ala Pro Gly Asp Pro Leu Ser Ser Pro Ser Pro Thr Ala 
35 40 45 

Leu Pro Gin Gly Gly Ser His Thr Glu Thr Glu Asp Arg Leu Phe Lys 
50 55 60 

His Leu Phe Arg Gly Tyr Asn Arg Trp Ala Arg Pro Val Pro Asn Thr 
65 70 75 80 
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Ser Asp Val Val He Val Arg Phe Gly Leu Ser He Ala Gin Leu He 

85 90 95 

Asp Val Asp Glu Lys Asn Gin Met Met Thr Thr Asn Val Trp Leu Lys 

100 105 HO 

Gin Glu Trp Ser Asp Tyr Lys Leu Arg Trp Asn Pro Thr Asp Phe Gly 
115 120 125 

Asn He Thr Ser Leu Arg Val Pro Ser Glu Met He Trp He Pro Asp 
130 135 140 

He Val Leu Tyr Asn Asn Ala Asp Gly Glu Phe Ala Val Thr His Met 
145 150 155 160 

Thr Lys Ala His Leu Phe Ser Thr Gly Thr Val His Trp Val Pro Pro 

165 170 175 

Ala He Tyr Lys Ser Ser Cys Ser He Asp Val Thr Phe Phe Pro Phe 

180 185 190 



Asp Gin Gin Asn Cys Lys Met Lys Phe Gly Ser Trp Thr Tyr Asp Lys 
195 200 205 

Ala Lys He Asp Leu Glu Gin Met Glu Gin Thr Val Asp Leu Lys Asp 
210 215 220 

Tyr Trp Glu Ser Gly Glu Trp Ala He Val Asn Ala Thr Gly Thr Tyr 
225 230 235 240 

Asn Ser Lys Lys Tyr Asp Cys Cys Ala Glu He Tyr Pro Asp Val Thr 

245 250 255 



Tyr Ala Phe Val lie Arg Arg Leu Pro Leu Phe Tyr Thr lie Asn Leu 

260 265 270 

He He Pro Cys Leu Leu He Ser Cys Leu Thr Val Leu Val Phe Tyr 
275 280 285 

Leu Pro Ser Asp Cys Gly Glu Lys He Thr Leu Cys He Ser Val Leu 
290 295 300 
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Leu Ser Leu Thr Val Phe Leu Leu Leu lie Thr Glu Xle He Pro Ser 
305 310 315 320 



Thr Ser Leu Val He Pro Leu He Gly Glu Tyr Leu Leu Phe Thr Met 

325 330 335 



He Phe Val Thr Leu Ser He Val He Thr Val Phe Val Leu Asn Val 

340 345 350 

His His Arg Ser Pro Ser Thr His Thr Met Pro His Trp Val Arg 'Gly 
355 360 365 



Ala Leu Leu Gly Cys Val Pro Arg Trp Leu Leu Met Asn Arg Pro Pro 
370 375 380 

Pro Pro Val Glu Leu Cys His Pro Leu Arg Leu Lys Leu Ser Pro Ser 
385 390 395 400 

Tyr His Trp Leu Glu Ser Asn Val Asp Ala Glu Glu Arg Glu Val Val 

405 410 415 

Val Glu Glu Glu Asp Arg Trp Ala Cys Ala Gly His Val Ala Pro Ser 

420 425 430 

Val Gly Thr Leu Cys Ser His Gly His Leu His Ser Gly Ala Ser Gly 
435 440 445 

Pro Lys Ala Glu Ala Leu Leu Gin Glu Gly Glu Leu Leu Leu Ser Pro 
450 455 460 

His Met Gin Lys Ala Leu Glu Gly Val His Tyr He Ala Asp His Leu 
465 470 475 480 

Arg Ser Glu Asp* Ala Asp Ser Ser Val Lys Glu Asp Trp Lys Tyr Val 

485 490 495 

Ala Met Val He Asp Arg He Phe Leu Trp Leu Phe He He Val Cys 

500 505 510 



Phe Leu Gly Thr He Gly Leu Phe Leu Pro Pro Phe Leu Ala Gly Met 
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515 520 525 



He 



<210> 71 

<211> 505 

<212> PRT 

<213> Homo sapiens 

<400> 71 

* • ■ 

Met Gly Ser Gly Pro Leu Ser Leu Pro Leu Ala Leu Ser Pro Pro Arg 
15 10 15 

Leu Leu Leu Leu Leu Leu Leu Ser Leu Leu Pro Val Ala Arg Ala Ser 

20 25 30 

Glu Ala Glu His His Leu Phe Glu Arg Leu Phe Glu Asp Tyr Asn Glu 
35 40 45 

He He Arg Pro Val Ala Asn Val Ser Asp Pro Val He He His Phe 
50 55 60 

Glu Val Ser Met Ser Gin Leu Val Lys Val Asp Glu Val Asn Gin He 
65 70 75 80 

Met Glu Thr Asn Leu Trp Leu Lys Gin He Trp Asn Asp Tyr Lys Leu 

85 90 95 

Lys Trp Asn Pro Ser Asp Tyr Gly Gly Ala Glu Phe Met Arg Val Pro 

100 105 110 

Ala Gin Lys He Trp Lys Pro Asp He Val Leu Tyr Asn Asn Ala Val 
115 120 125 • 

Gly Asp Phe Gin Val Asp Asp Lys Thr Lys Ala Leu Leu Lys Tyr Thr 
130 135 140 

Gly Glu Val Thr Trp He Pro Pro Ala He Phe Lys Ser Ser Cys Lys 
145 150 155 160 

He Asp Val Thr Tyr Phe Pro Phe Asp Tyr Gin Asn Cys Thr Met Lys 
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165 170 175 

Phe Gly Ser Trp Ser Tyr Asp Lys Ala Lys lie Asp Leu Val Leu He 

180 185 190 

Gly Ser Ser Met Asn Leu Lys Asp Tyr Trp Glu Ser Gly Glu Trp Ala 
195 200 205 

He He Lys Ala Pro Gly Tyr Lys His Asp He Lys Tyr Asn Cys Cys 
210 215 220 

Glu Glu He Tyr Pro Asp He Thr Tyr Ser Leu Tyr He Arg Arg Leu 
225 230 235 240 

Pro Leu Phe Tyr Thr He Asn Leu He He Pro Cys Leu Leu He Ser 

245 250 . . 255 

■ 

Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys 

260 265 270 

Val Thr Leu Cys He Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu 
275 280 285 

Val He Thr Glu Thr lie Pro Ser Thr Ser Leu Val He Pro Leu He 
290 295 300 

Gly Glu Tyr Leu Leu Phe Thr Met He Phe Val Thr Leu Ser He Val 
305 310 315 320 

He Thr Val Phe Val Leu Asn Val His Tyr Arg Thr Pro Thr Thr His 

325 330 335 

Thr Met Pro Ser Trp Val Lys Thr Val Phe Leu Asn Leu Leu Pro Arg 

340 . 345 350 

Val Met Phe Met Thr Arg Pro Thr Ser Asn Glu Gly Asn Ala Gin Lys 
355 360 365 



Pro Arg Pro Leu Tyr Gly Ala Glu Leu Ser Asn Leu Asn Cys Phe Ser 
370 375 380 
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Arg Ala Glu Ser Lys Gly Cys Lys Glu Gly Tyr Pro Cys Gin Asp Gly 
385 390 395 400 

Met Cys Gly Tyr Cys His His Arg Arg lie Lys He Ser Asn Phe Ser 

405 410 415 

Ala Asn Leu Thr Arg Ser Ser Ser Ser Glu Ser Val Asp Ala Val Leu 

420 425 430 

Ser Leu Ser Ala Leu Ser Pro Glu He Lys Glu Ala He Gin Ser Val 
435 440 445 

Lys Tyr He Ala Glu Asn Met Lys Ala Gin Asn Glu Ala Lys Glu He 
450 455 460 

Gin Asp Asp Trp Lys Tyr Val Ala Met Val He Asp Arg He Phe Leu 
46 5 470 " 475 480 

Trp Val Phe Thr Leu Val Cys He Leu Gly Thr Ala Gly Leu Phe Leu 

485 . 490 495 

Gin Pro Leu Met Ala Arg Glu Asp Ala 

500 505 

<210> 72 

<211> . 118 

<212> PRT 

<213> Homo sapiens 

<400> 72 

Met Val Gin Lys Ser Arg Asn Gly Gly Val Tyr Pro Gly Pro Ser Gly 
15 10 15 

Glu Lys Lys Leu Lys Val Gly Phe Val Gly Leu Asp Pro Gly Ala Pro 

20 25 30 

Asp Ser Thr Arg Asp Gly Ala Leu Leu He Ala Gly Ser Glu Ala Pro 
35 40 45 

Lys Arg Gly Ser He Leu Ser Lys Pro Arg Ala Gly Gly Ala Gly Ala 
50 55 60 
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Gly Lys Pro Pro Gin Ala Gin Arg Leu Leu Pro Gin Ala Ala Glu Phe 
65 70 75 80 



Pro Leu Gin Arg Ala Gly Ala Ala Ala Arg Leu Gly Val His Leu Pro 

85 90 95 

Arg Leu Arg Val Pro Pro Gly Phe Leu Leu Pro Arg Ala Val Cys Val 

100 105 110 



Phe His 


His 


Gin Gly Val 




115 


<210> 


73 




<211> 


854 




<212> 


PRT 




<213> 


Homo 


sapiens 


<400> 


73 




Met Val 


Gin 


Lys Ser Arg 


1 




5 



10 15 

Glu Lys Lys Leu Lys Val Gly Phe Val Gly Leu Asp Pro Gly Ala Pro 

20 25 x 30 

Asp Ser Thr Arg Asp Gly Ala Leu Leu lie Ala Gly Ser Glu Ala Pro 
35 40 45 



Lys Arg Gly Ser lie Leu Ser Lys Pro Arg Ala Gly Gly Ala Gly Ala 
50 55 60 

Gly Lys Pro Pro Lys Arg Asn Ala Phe Tyr Arg Lys Leu Gin Asn Phe 
65 70 75 80 

Leu Tyr Asn Val Leu Glu Arg Pro Arg Gly Trp Ala Phe lie Tyr His 

85 90 95 

Ala Tyr Val Phe Leu Leu Val Phe Ser Cys Leu Val Leu Ser Val Phe 

100 105 110 

Ser Thr lie Lys Glu Tyr Glu Lys Ser Ser Glu Gly Ala Leu Tyr lie 
115 120 125 
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Leu Glu lie Val Thr He Val Val Phe Gly Val Glu Tyr Phe Val Arg 
130 135 140 

He Trp Ala Ala Gly Cys Cys Cys Arg Tyr Arg Gly Trp Arg Gly Arg 
145 150 155 ^ 160 



Leu Lys Phe Ala Arg Lys Pro Phe Cys Val He Asp He Met Val Leu 

165 170 175 

He Ala Ser He Ala Val Leu Ala Ala Gly Ser Gin Gly Asn Val Phe 

180 185 190 

Ala Thr Ser Ala Leu Arg Ser Leu Arg Phe Leu Gin He Leu Arg Met 
195 200 205 

He Arg Met Asp Arg Arg Gly Gly Thr Trp Lys Leu Leu Gly Ser Val 
210 215 220 

Val Tyr Ala His Ser Lys Glu Leu Val Thr Ala Trp Tyr He Gly Phe 
225 230 . 235 240 

Leu Cys Leu He Leu Ala Ser Phe Leu Val Tyr Leu Ala Glu Lys Gly 

245 250 255 

Glu Asn Asp His Phe Asp Thr Tyr Ala Asp Ala Leu Trp Trp Gly Leu 

260 265 270 

He Thr Leu Thr Thr He Gly Tyr Gly Asp Lys Tyr Pro Gin Thr Trp 
275 280 285 

Asn Gly Arg Leu Leu Ala Ala Thr Phe Thr Leu He Gly Val Ser Phe 
290 295 300 

« 

Phe Ala Leu Pro Ala Gly He Leu Gly Ser Gly Phe Ala Leu Lys Val 
305 310 315 320 

Gin Glu Gin His Arg Gin Lys His Phe Glu Lys Arg Arg Asn Pro Ala 

325 330 335 

Ala Gly Leu He Gin Ser Ala Trp Arg Phe Tyr Ala Thr Asn Leu Ser 

340 345 350 
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Gly Thr Asp Leu His Ser Thr Trp Gin Tyr Tyr Glu Arg Thr Val Thr 
355 360 365 

Val Pro Met Tyr. Ser Ser Gin Thr Gin Thr Tyr Gly Ala Ser Arg Leu 
370 375 33 0 

lie Pro Pro Leu Asn Gin Leu Glu. Leu Leu Arg Asn Leu Lys Ser Lvs 
385 390 395 400 

Ser Gly Leu Ala Phe Arg Lys Asp Pro Pro Pro Glu Pro Ser Pro Ser 

405 410 415 

Gin Lys Val Ser Leu Lys Asp Arg Val Phe Ser Ser Pro Arg Gly Val 

420 425 430 

Ala Ala Lys Gly Lys Gly Ser Pro Gin Ala Gin Thr Val Arg Arg Ser 
435 440 445 

Pro Ser Ala Asp Gin Ser Leu Glu Asp Ser Pro Ser Lys Val Pro Lvs 
450 455 460 

Ser Trp Ser Phe Gly Asp Arg Ser Arg Ala Arg Gin Ala Phe Arg He 
465 470 475 480 

Lys Gly Ala Ala Ser Arg Gin Asn Ser Glu Glu Ala Ser Leu Pro Gly 

485 490 495 

Glu Asp He Val Asp Asp Lys Ser Cys Pro Cys Glu Phe Val Thr Glu 

500 505 510 

Asp Leu Thr Pro Gly Leu Lys Val Ser He Arg Ala Val Cys Val Met 
515 520 525 

Arg Phe Leu Val Ser Lys Arg Lys Phe Lys Glu Ser Leu Arg- Pro Tyr 
530 535 540 



Asp Val Met Asp Val He Glu Gin Tyr Ser Ala Gly His Leu Asp Met 
545 550 555 560 

Leu Ser Arg He Lys Ser Leu Gin Ser Arg Val Asp Gin lie Val Gly 

565 570 575 
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Arg Gly Pro Ala lie Thr Asp Lys Asp Arg Thr Lys Gly Pro Ala Glu 

580 585- 590 

Ala Glu Leu Pro Glu Asp Pro Ser Met Met Gly Arg Leu Gly Lys Val 
595 600 605 

Glu Lys Gin Val Leu Ser Met Glu Lys Lys Leu Asp Phe Leu Val Asn 
610 615 620 

He Tyr Met Gin Arg Met Gly He Pro Pro Thr Glu Thr Glu Ala Tyr 
625 630 635 640 

Phe Gly Ala Lys Glu Pro Glu Pro Ala Pro Pro Tyr His Ser Pro Glu 

645 650 655 

Asp Ser Arg Glu His Val Asp Arg His Gly Cys He Val Lys He Val 

660 665 670 

Arg Ser Ser Ser Ser Thr Gly Gin Lys Asn Phe Ser Ala Pro Pro Ala 
675 680 685 

Ala Pro Pro Val Gin Cys Pro Pro Ser Thr Ser Trp Gin Pro Gin Ser 
690 695 700 

His Pro Arg Gin Gly His Gly Thr Ser Pro Val Gly Asp His Gly Ser 
705 710 715 720 

Leu Val Arg He Pro Pro Pro Pro Ala His Glu Arg Ser Leu Ser Ala 

725 730 735 

Tyr Gly Gly Gly Asn Arg Ala Ser Met Glu Phe Leu Arg Gin Glu Asp 

740 745 N 750 

Thr Pro Gly Cys Arg Pro Pro Glu Gly Thr Leu Arg Asp Ser Asp Thr 
755 760 765 

Ser He Ser He Pro Ser Val Asp His Glu Glu Leu Glu Arg Ser Phe 
770 775 780 



Ser Gly Phe Ser He Ser Gin Ser Lys Glu Asn Leu Asp Ala Leu Asn 
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790 795 



800 



Ser Cys Tyr Ala Ala Val Ala Pro Cys Ala Lys Val Arg Pro Tyr He 

805 - 810 815 



Ala Glu Gly Glu Ser Asp Thr Asp Ser Asp .Leu Cys Thr Pro Cys Gly 

820 825 ~ 830 



Pro Pro Pro Arg Ser Ala Thr Gly Glu Gly Pro Phe Gly Asp Val Gly 
835 840 845 

Trp Ala Gly Pro Arg Lys 
850 



<210> 74 

<211> 429 

<212> PRT 

<213> Homo sapiens 

<400> 74 J 

Met Val Gin Lys Ser Arg Asn Gly Gly Val Tyr Pro Gly Pro Ser Gly 
15 10 15 



Glu Lys Lys Leu Lys Val Gly Phe Val Gly Leu Asp Pro Gly Ala Pro 

20 25 30 

Asp Ser Thr Arg Asp Gly Ala Leu Leu He Ala Gly Ser Glu Ala Pro 
35 40 45 

Lys Arg Gly Ser He Leu Ser Lys Pro Arg Ala Gly Gly Ala Gly Ala 
50 55 60 



Gly Lys Pro Pro Lys Arg Asn Ala Phe Tyr Arg Lys Leu Gin Asn Phe 

65 70 75 80 

Leu Tyr Asn Val Leu Glu Arg Pro Arg Gly Trp Ala Phe lie Tyr His 

85 90 95 

Ala Tyr Val Phe Leu Leu Val Phe Ser Cys Leu Val Leu Ser Val Phe 

100 105 " • 110 



Ser Thr He Lys Glu Tyr Glu Lys Ser Ser Glu Gly Ala Leu Tyr He 
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115 120 125 

Leu Glu He Val Thr He Val Val Phe Gly Val Glu Tyr Phe Val Arg 
130 135 140 

He Trp Ala Ala Gly Cys Cys Cys Arg Tyr Arg Gly Trp Arg Gly Arg 
145 150 155 160 

Leu Lys Phe Ala Arg Lys Pro Phe Cys Val He Asp He Met Val Leu 

165 170 175 

He Ala Ser He Ala Val Leu Ala Ala Gly Ser Gin Gly Asn Val Phe 

180 185 190 

Ala Thr Ser Ala Leu Arg Ser Leu Arg Phe Leu Gin He Leu Arg Met 
195 ; 200 205 

He Arg Met Asp Arg Arg Gly Gly Thr Trp Lys Leu Leu Gly Ser Val 
210 215 220 

Val Tyr Ala His Ser Lys Glu Leu Val Thr Ala Trp Tyr lie Gly Phe 
225 230 235 240 

Leu Cys Leu He Leu Ala Ser Phe Leu Val Tyr Leu Ala Glu Lys Gly 

245 250 255 . 

Glu Asn Asp His Phe Asp Thr Tyr Ala Asp Ala Leu Trp Trp Gly Leu 

260 265 270 

He Thr Leu Thr Thr He Gly Tyr Gly Asp Lys Tyr Pro Gin Thr Trp 
275 280 285 

Asn Gly Arg Leu Leu Ala Ala Thr Phe Thr Leu He Gly Val Ser Phe 
290 295 300 

Phe Ala Leu Pro Ala Gly He Leu Gly Ser Gly Phe Ala Leu Lys Val 
305 310 315 320 

Gin Glu Gin His Arg Gin Lys' His Phe Glu Lys Arg Arg Asn Pro Ala 

325 330 335 
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Ala Gly Leu lie Gin Ser Ala Trp Arg Phe Tyr Ala Thr Asn Leu Ser 

340 345 350 



Arg Thr Asp Leu His Ser Thr Trp Gin Tyr Tyr Glu Arg Thr Val Thr 
355 360 365 



Val Pro Met Tyr Ser Ser Gin Thr Gin Thr Tyr Gly Ala Ser Arg Leu 
370 375 380 

lie Pro Pro Leu Asn Gin Leu Glu Leu Leu Arg Asn Leu Lys Ser Lys 
385 390 395 400 



Ser Gly Leu Ala Phe Arg Lys Asp Pro Pro Pro Glu Pro Ser Pro Ser 

405 410 415 



Gin Lys Val Ser Leu Lys Asp Arg Val Phe Ser Ser Pro 

420 425 



<210> 75 

<211> 854 

<212> PRT 

<213> Homo sapiens 

<400> 75 

Met Val Gin Lys Ser Arg Asn Gly Gly Val Tyr Pro Gly Pro Ser Gly 
1 5 10 15 

Glu Lys Lys Leu Lys Val Gly Phe Val Gly Leu Asp Pro Gly Ala Pro 

20 25 30 

Asp Ser Thr Arg Asp Gly Ala Leu Leu lie Ala Gly Ser Glu Ala Pro 
35 40 45 

Lys Arg Gly Ser He Leu Ser Lys Pro Arg Ala Gly Gly. Ala Gly Ala 
50 55 60 

Gly Lys Pro Pro Lys Arg Asn Ala Phe Tyr Arg Lys Leu Gin Asn Phe 
65 70 75 80 

Leu Tyr Asn Val Leu Glu Arg Pro Arg Gly Trp Ala Phe He Tyr His 

85 90 95 
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Ala Tyr Val Phe Leu Leu Val Phe Ser Cys Leu Val Leu Ser Val Phe 

100 105 110 



Ser Thr lie Lys Glu Tyr Glu Lys Ser Ser Glu Gly Ala Leu Tyr lie 
115 120 125 



Leu Glu He Val Thr He Val Val Phe Gly Val Glu Tyr Phe Val Arg 
130 135 140 



He Trp Ala Ala Gly Cys Cys Cys Arg Tyr Arg Gly Trp Arg Gly Arg 
145 150 155 160 



Leu Lys Phe Ala Arg Lys Pro Phe Cys Val He Asp He Met Val Leu 

165 170 175 

He Ala Ser He Ala Val Leu Ala Ala Gly Ser Gin Gly Asn Val Phe 

180 185 190 

Ala Thr Ser Ala Leu Arg Ser Leu Arg Phe Leu Gin He Leu Arg Met 
195 200 205 

He Arg Met Asp Arg Arg Gly Gly Thr Trp Lys Leu Leu Gly Ser Val 
210 215 220 

Val Tyr Ala His Ser Lys Glu Leu Val Thr Ala Trp Tyr He Gly Phe 
225 230 • 235 240 

Leu Cys Leu lie Leu Ala Ser Phe Leu Val Tyr Leu Ala Glu Lys Gly 

245 250 255 

Glu Asn Asp His Phe Asp Thr Tyr Ala Asp Ala Leu Trp Trp Gly Leu 

260 265 270 

He Thr Leu Thr Thr He Gly Tyr Gly Asp Lys Tyr Pro Gin Thr Trp 
275 280 ' 285 

Asn Gly Arg Leu Leu Ala Ala Thr Phe Thr Leu He Gly Val Ser Phe 
290 295 300 

Phe Ala Leu Pro Ala Gly He Leu Gly Ser Gly Phe Ala Leu Lys Val 
305 310 315 320 
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Gin Glu Gin His Arg Gin Lys His Phe Glu Lys Arg Arg Asn Pro Ala 

325 330 335 



Ala Gly Leu lie Gin Ser Ala Trp Arg Phe Tyr Ala Thr Asn Leu Ser 

340 345 350 



Arg Thr Asp Leu His Ser Thr Trp Gin Tyr Tyr Glu Arg Thr Val Thr 
355 360 365 

Val Pro Met Tyr Ser Ser Gin Thr Gin Thr Tyr Gly Ala Ser Arg Leu 

370 375 380 



lie Pro Pro Leu Asn Gin Leu Glu Leu Leu Arg Asn Leu Lys Ser Lys 
385 390 . 395 400 

Ser Gly Leu Ala Phe Arg Lys Asp Pro Pro Pro Glu Pro Ser Pro Ser 

405 410 415 

Gin Lys Val Ser Leu Lys Asp Arg Val Phe Ser Ser Pro Arg Gly Val 

420 425 430 

Ala Ala Lys Gly Lys Gly Ser Pro Gin Ala Gin Thr Val Arg Arg Ser 
435 440 445 

Pro Ser Ala Asp Gin Ser Leu Glu Asp Ser Pro Ser Lys Val Pro Lys 
450 455 460 

Ser Trp Ser Phe Gly Asp Arg Ser Arg Ala Arg Gin Ala Phe Arg lie 
465 470 475 480 

Lys Gly Ala Ala Ser Arg Gin Asn Ser Glu Glu Ala Ser Leu Pro Gly 

485 490 495 

Glu Asp He Val Asp Asp Lys Ser Cys Pro Cys Glu Phe Val Thr Glu 

500 505 510 

Asp Leu Thr Pro Gly Leu Lys Val Ser He Arg Ala Val Cys Val Met 
515 520 525 

Arg Phe Leu Val Ser Lys Arg Lys Phe Lys Glu Ser Leu Arg Pro Tyr 
530 535 540 
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Asp Val Met Asp Val lie Glu Gin Tyr Ser Ala Gly His Leu Asp Met 
545 550 555 560 

Leu Ser Arg He Lys Ser Leu Gin Ser Ser Val Asp Gin He Val Gly 

565 570 ' 575 

Arg Gly Pro Ala He Thr Asp Lys Asp Arg Thr Lys Gly Pro Ala Glu 

580 585 590 

Ala Glu Leu Pro Glu Asp Pro Ser Met Met Gly Arg Leu Gly Lys Val 
595 600 605 

Glu Lys Gin Val Leu Ser Met Glu Lys Lys Leu Asp Phe Leu Val Asn 
610 615 620 

He Tyr Met Gin Arg Met Gly He Pro Pro Thr Glu Thr Glu Ala Tyr 
625 630 635 640 

Phe Gly Ala Lys Glu Pro Glu Pro Ala Pro Pro Tyr His Ser Pro Glu 

645 650 655 

Asp Ser Arg Glu His Val Asp Arg His Gly Cys He Val Lys He Val 

660 665 670 

Arg Ser Ser Ser Ser Thr Gly Gin Lys Asn Phe Ser Ala Pro Pro Ala 
675 680 685 

Ala Pro Pro Val Gin Cys Pro Pro Ser Thr Ser Trp Gin Pro Gin Ser 
690 695 700 

His Pro Arg Gin Gly His Gly Thr Ser Pro Val Gly Asp His Gly Ser 
705 710 715 720 

Leu Val Arg* He Pro Pro Pro Pro Ala His Glu Arg Ser Leu Ser Ala 

725 730 735 

Tyr Gly Gly Gly Asn Arg Ala Ser Met Glu Phe Leu Arg Gin Glu Asp 

740 745 750 

Thr Pro Gly Cys Arg Pro Pro Glu Gly Thr Leu Arg Asp Ser Asp Thr 
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755 760 765 



Ser He Ser He Pro Ser Val Asp His Glu Glu Leu Glu Arg Ser Phe 
770 775 780 



Ser Gly Phe Ser He Ser Gin Ser Lys Glu Asn Leu Asp Ala Leu Asn 
785 790 795 800 

Ser Cys Tyr Ala Ala Val Ala Pro Cys Ala Lys Val Arg Pro Tyr He 

805 810 815 

Ala Glu Gly Glu Ser Asp Thr Asp Ser Asp Leu Cys Thr Pro Cys Gly 

820 825 830 

Pro Pro Pro Arg Ser Ala Thr Gly Glu Gly Pro Phe Gly Asp Val Gly 
835 840 845 



Trp Ala Gly Pro Arg Lys 
850 



<210> 76 

<211> 854 

<212> PRT 

<213> Homo sapiens 

<400> 76 

Met Val Gin Lys Ser Arg Asn Gly Gly Val Tyr Pro Gly Pro Ser Gly 
15 10 15 

Glu Lys Lys Leu Lys Val Gly Phe Val Gly Leu Asp Pro Gly Ala Pro 

20 25 30 

Asp Ser Thr Arg Asp Gly Ala Leu Leu He Ala Gly Ser Glu Ala Pro 
35 40 45 



Lys Arg Gly Ser He Leu Ser Lys Pro Arg Ala Gly Gly Ala Gly Ala 
50 55 60 



Gly Lys Pro Pro Lys Arg Asn Ala Phe Tyr Arg Lys Leu Gin Asn Phe 
65 70 75 80 



Leu Tyr Asn Val Leu Glu Arg Pro Arg Gly Trp Ala Phe He Tyr His 
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85 SO 95 



Ala Tyr Val Phe Leu 

100 



Ser Thr He Lys Glu 
115 



Leu Glu He Val Thr 
130 



He Trp Ala Ala Gly 
145 



Leu Lys Phe Ala Arg 

165 



He Ala Ser He Ala 

180 



Ala Thr Ser Ala Leu 
195 



He Arg Met Asp Arg 
210 

i 



Val Tyr Ala His Ser 
225 



Leu cys Leu He Leu 

245 



Glu Asn Asp His Phe 

260 



He Thr Leu Thr Thr 
275 



Asn Gly Arg Leu Leu 
290 



Leu Val Phe Ser Cys Leu 

105 



Tyr Glu Lys Ser Ser Glu 
120 



He Val Val Phe Gly Val 
135 



Cys Cys Cys Arg Tyr Arg 
150 155 



Lys Pro Phe Cys Val He 

170 



Val Leu Ala Ala Gly Ser 

185 



Arg Ser Leu Arg Phe Leu 
200 



Arg Gly Gly Thr Trp Lys 
215 



Lys Glu Leu Val Thr Ala 
230 235 



Ala Ser Phe Leu Val Tyr 

250 



Asp Thr Tyr Ala Asp Ala 

265 



He Gly Tyr Gly Asp Lys 
280 



Ala Ala Thr Phe Thr Leu 
295 



Val Leu Ser Val Phe 
110 



Gly Ala Leu Tyr He 
125 



Glu Tyr Phe Val Arg 
140 



Gly Trp Arg Gly Arg 

160 



Asp He Met Val Leu 

175 



Gin Gly Asn Val Phe 
190 



Gin He Leu Arg Met 
205 



Leu Leu Gly Ser Val 
220 



Trp Tyr He Gly Phe 

240 



Leu Ala Glu Lys Gly 

255 



Leu Trp Trp Gly Leu 
270 



Tyr Pro Gin Thr Trp 
285 



He Gly Val Ser Phe 
300 
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Phe Ala Leu Pro Ala Gly lie Leu Gly Ser Gly Phe Ala Leu Lys Val 
305 310 315 320 



Gin Glu Gin His 



Ala Gly Leu lie 

340 



Arg Thr Asp Leu 
355 



Val Pro Met Tyr 
370 



lie Pro Pro Leu 
385 



Ser Gly Leu Ala 



Gin Lys Val Ser 

420 



Ala Ala Lys Gly 
435 



Pro Ser Ala Asp 
450 



Ser Trp Ser Phe 
465 



Lys Gly Ala Ala 



Glu ,Asp lie Val 

500 



Asp Leu Thr Pro 
515 



Arg Gin Lys His 
325 



Gin Ser Ala Trp 



His Ser Thr Trp 

360 



Ser Ser Gin Thr 
375 



Asn Gin Leu Glu 
390 



Phe Arg Lys Asp 
405 



Leu Lys Asp Arg 



Lys Gly Ser Pro 

440 



Gin Ser Leu Glu 
455 



Gly Asp Arg Ser 
470 



Ser Arg Gin Asn 
485 



Asp Asp Lys Ser 



Gly Leu Lys Val 

520 



Phe Glu Lys Arg 
330 



Arg Phe Tyr Ala 
345 



Gin Tyr Tyr Glu 



Gin Thr Tyr Gly 

380 



Leu Leu Arg Asn 
395 



* 

Pro Pro Pro Glu 
410 



Val Phe Ser Ser 
425 



Gin Ala Gin Thr 



Asp Ser Pro Ser 

460 



Arg Ala Arg Gin 
475 



Ser Qlu Glu Ala 
490 



Cys Pro Cys Glu 
505 



Ser lie Arg Ala 



Arg Asn Pro Ala 
335 



Thr Asn Leu Ser 
350 



Arg Thr Val Thr 
365 



Ala Ser Arg Leu 



Leu Lys Ser Lys 

400 



Pro Ser Pro Ser 
415 



Pro Arg Gly Val 
430 



Val Arg Arg Ser 
445 



Lys Val Pro Lys 



Ala Phe Arg He 

480 



Ser Leu Pro Gly 
495 



Phe Val Thr Glu 
510 



Val Cys Val Met 
525 
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Arg Phe Leu Val Ser Lys Arg Lys Phe Lys Glu Ser Leu Arg Pro Tyr 
530 535 540 

Asp Val Met Asp Val lie Glu Gin Tyr Ser Ala Gly His Leu Asp Met 
545 550 555 560 

Leu Ser Arg lie Lys Ser Leu Gin Ser Arg Val Asp Gin lie Val Gly 

565 570 575 

Arg Gly Pro Ala He Thr Asp Lys Asp Arg Thr Lys Gly Pro Ala Glu 

580 585 590 

Ala Glu Leu Pro Glu Asp Pro Ser Met Met Gly Arg Leu Gly Lys Val 
595 .600 605 

Glu Lys Gin Val Leu Ser Met Glu Lys Lys Arg Asp Phe Leu Val Asn 
610 615 620 

He Tyr Met Gin Arg Met Gly He Pro Pro Thr Glu Thr Glu Ala Tyr 
625 630 635 640 

Phe Gly Ala Lys Glu Pro Glu Pro Ala Pro Pro Tyr His Ser Pro Glu 

645 650 655 

Asp Ser Arg Glu His Val Asp Arg His Gly Cys He Val Lys He Val 

660 665 670 

Arg Ser Ser Ser Ser Thr Gly Gin Lys Asn Phe Ser Ala Pro Pro Ala 
675 680 685 

Ala Pro Pro Val Gin Cys Pro Pro Ser Thr Ser Trp Gin Pro Gin Ser 
690 695 700 

His Pro Arg Gin Gly His Gly Thr Ser Pro Val Gly Asp His Gly Ser 
705 710 , 715 720 

Leu Val Arg He Pro Pro Pro Pro Ala His Glu Arg Ser Leu Ser Ala 

725 730 735 

Tyr Gly Gly Gly Asn Arg Ala Ser Met Glu Phe Leu Arg Gin Glu Asp 

740 " 745 750 
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Thr Pro Gly Cys Arg Pro Pro Glu Gly Thr Leu Arg Asp Ser Asp Thr 
755 760 765 

Ser lie Ser lie Pro Ser Val Asp His Glu Glu Leu Glu Arg Ser Phe 
770 775 780 

Ser Gly Phe Ser lie Ser Gin Ser Lys Glu Asn Leu Asp Ala Leu Asn 
785 790 795 800 

Ser Cys Tyr Ala Ala Val Ala Pro Cys Ala Lys Val Arg Pro Tyr lie 

805 810 815 

Ala Glu Gly Glu Ser Asp Thr Asp Ser Asp Leu Cys Thr Pro Cys Gly 

820 825 830 



Pro Pro Pro Arg Ser Ala Thr Gly Glu Gly Pro Phe Gly Asp Val Gly 
835 840 845 



Trp Ala Gly Pro Arg Lys 
850 
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